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Abstract 


Muiic Notation 6y Computer 
Donald Alvin Byrd 


Musicians made their first attempts to ose computers for musical purposes about 25 
yean ago. A major problem that confronted them -was that neither musical sound nor the 
standard representation of musie — conventional music notation (henceforth called 
“CMN") — could be either input or output by existing computer systems. Since then ail 
four aspects of the musicians' problem — sound input, sound output, CMN input, and 
CMN output — have been attacked, with varying de g re es of success. This project is con* 
cerned with one of these aspects, namely CMN output. 

Producing CMN output is moat generally viewed as having three aspects: selecting the 
symbols to print, positioning them, i.e., deciding where to print them, and actually printing 
them. The positioning aspect is the core of the problem: it is essentially a question of for- 
matting, analogous to formatting natural language text, but much more difficult. A solu- 
tion to this aspect of the musicians’ problem, that of automatic music formatting, would 
not only be very valuable to musicians, but would also be quite interesting and nontrivial 
from the computer scientist's viewpoint. 

I argue, and give examples by major composers to show, that “fully automatic high- 
quality musie notation" is not merely nontrivial but in general impossible without human- 
level intelligence. Since current work in artificial intelligence is far from this level, one must 
compromise, and three ways are possible: compromise in degree of automation, in quality of 
output, or in generality of input. Meat workers on the problem have primarily sacrificed 
automation. The present work, in contrast, primarily sacrifices generality of input, partly 
due to historical accident but partly an pLilosophical grounds. A major chapter cf the 
dissertation documents my program as so far implemented. The dissertation concludes with 
a discussion of the central unsolved problems of automatic music formatting. 
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Preface 


I have just a few general comments to make here. 

First, even though this dissertation is “in” computer science, its content deals to a great 
extent with musical matters, and I want to make clear how seriously I take the musical con- 
tent. In the Preface to The Art of Computer Programming Donald Knuth remarks: “Since I 
myself profess to he a mathematician, it is my duty to maintain mathematical integrity as 
well as I can.” I profess to be a musician, and I consider it my duty to maintain musical 
integrity, not “as well as ! can”, but simply and without qualification “well”. 

Second, since this dissertation is concerned with formatting and printing, readers may be 
especially interested in how it was itself prepared, both music and text. The text was pro- 
duced entirely on a VAX 11/780 under Berkeley UNIX. It was entered and edited with 
EMACS; act in Computer Modern with ITROFF, a version of the well-known TROFr 
typesetting program; 3nd printed on a Canon/Imagen LBP-1G laser printer with 240 points 
per inch resolution. The examples of mathematical notation in Chapter 2 were also set by 
ITROFF, aided by the EQN mathematics preprocessor. In Chapter 5, Fig. 2 was set by 
ITROFF, and Fig. 4 is from G. M. Hunter’s dissertation (HUNT78). The musical examples 
were prepared in several different ways. Some were set by my MUSTRAN/SMUT music 
printing system running on a CD C 6800 and printed on a Zeta 3653SX pen plotter or Versa- 
tec 1200 electrostatic plotter; the captions of figures set this way explicitly identify them as 
such. The others were cither reproduced from published editions or hand-drawn, and the 
reader should have no difficulty telling which are which. All other figures were drawn by 
me. Figure captions and text within figures were prepared in several different ways, which I 
will net go into. 

The body of this dissertation was completed over a year ago, in mid-1683. 1 have added 
a postscript to cover developments through mid-1984. 
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It b probably pointless to carry out the listing of these horrors any further. By now the reader can 
likely begin to construct hb own examples of barely manageable metric complexity. It should cer- 
tainly be clear that if there b any sort of general solution it b not one that jumps forth and makes 
itself unmbtakably known. 

David Gomberg, A Computer-Oriented Syttem for Mu tic 
Printing (ScD. dissertation, Washington University, 
1975), p. 64 

The interesting point to notice b the admirable illustration which thb [Arabic] numeral system 
affords of the enormous importance of good notation. By relieving the brain of all unnecessary work, 
a good notation sets it free to concentrate on more advanced problems, and in effect increases the 
mental power of the race. Before the introduction of the Arabic notation, multiplication was difficult, 
and the divbion even of integers called into play the highest mathematical faculties* Probably noth- 
ing is the modern world would have more astonbhed a Greek mathematician than to learn that, 
under the influence of compulsory education, a large proportion of the population of Western Europe 
could perform the operation of divbion for the largest numbers. 

A. N. Whitehead, An Introduction to Mathcmatice 
(Oxford, 1911) 

Standard music notation offers a familiar and at the same time a remarkable case [of a notation sys- 
tem actually used in the arts]. It b at once complex, serviceable, and — like Arabic numerical 
notation — common to the users of many different verbal languages. No alternative has gained any 
currency; and apparently no other culture, such as the Chinese or the Indian, has developed any 
comparably effective music notation over the centuries. 

Nelson Goodman, Language* of Art (Hackett, 1976), p. 

m 

Computer music systems must continue to evolve in order for their developers to incorporate all of 
the refinements that will make them truly efficient and "friendly” toob for musicians* After all, the 
violin evolved over centuries and the acoustic piano took generations to reach its present musical 
excellence. We have no reason to believe that the ultimate computer music system, whether it costs 
1 1000 or $50,000, b close at hand. 

Robert A. Moog, “The Soundchaser Computer Music 
System 1 ', Byte 7,12 (December 1982), p* 276 

The literary arts use a basic set of symbols, the Roman alphabet, which has taken some 3,000 years 
to develop and has ousted several elaborate systems along the way. The musical arts use an entirely 
different system, staff notation; and though its period of continuous development has been 
shorter — perhaps no more than 1,500 years — its task b very much greater* 

Thurston Dart, TAe Interpret alien of Music (Harper and 
Row, 1963), p. 11 

By now it b surely evident that the notator must approach hb decisions with a flexible mind. 

Gardner Read, Mutic Notation, 2nd ed* (Crescendo, 
I960), p. 86 



1 

Introduction 


1.1. THE MUSIC SETTING PROBLEM 

Musicians made their first attempts to use computers for musical purposes about 
25 years ago.[l] A major problem that confronted them was that neither musical 
sound nor the standard representation of music, conventional music notation (hence* 
forth called “CMN"), could be either input or output by existing computer systems. 
Since then all four aspects of the musicians’ problem — sound input, sound output, 
CMN input, and CMN output — have been attacked, with varying degrees of suc- 
cess. This project is concerned with one of these aspects, namely CMN output. 

Producing CMN output is most generally viewed as having three aspects: t elect- 
ing the symbols to print, petitioning them, i.e., deciding where to print them, and 
actually printing them. The positioning aspect is the core of the problem: it is essen- 
tially a question of formatting, analogous to formatting natural language text but 
much more difficult. A solution to this aspect of the musicians’ problem, that of 
automatic music formatting, would not only be very valuable to musicians, but 
would also be quite interesting and nontrivial from the computer scientist’s 
viewpoint.[2] It is so far from trivial, in fact, that most workers on the problem have 
abandoned totally automatic formatting and developed systems with important 
interactive components. Such an approach has substantial advantages quite apart 
from sidestepping intractable problems. The approach has been very successful, but 
has lead to interesting and difficult questions of user interface design. The present 
approach, in contrast, is highly automatic, partly due to historical accident but 
partly on philosophical grounds. 

1.2. TERMINOLOGY 

We will need a small amount of terminology before going on. The first term that 
needs clarification, in fact, is in the title of this dissertation. “Music Notation by 
Computer” is a somewhat misleading or, at best, ambiguous title. I might have said 
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“Music Printing by Computer", but “printing" b not really the right term, since it 
suggests reproduction processes zs well as the process of preparing the master copy. 
To clarify the problem and the usage I will adopt, I cannot do better than to quote 
the one previous computer science dissertation on music notation by computer 
(GOMB75, p. 1-2]: 

The term music printing encompasses several production processes. An engraver (or 
other craftsman) produces an original image of the eventual printed page. To 
preserve the original image, it is replicated several times and the actual printed im- 
pressions are made from the replicas. There b no simple generic term to describe the 
production of the original image because the method of production usually lends its 
name to the process. The terms set and setting in conjunction with printing general- 
ly imply the use of type slugs, and although there have been some attempts to use 
typesetting for music, these terms will be adopted here to describe the production of 
the original image regardless of the method of production. Music setting then b to 
be understood as the process of placing music symbols in their required places. 
Music printing will refer to the entire set of processes involved in producing a 
finished piece of printed music. 

Strictly speaking, then, my title should be “Music Setting by Computer”, but the 
phrase “music setting" is so little used that it might have caused worse misunder- 
standings than the less accurate title. 

To put things in context more, two related terms are “transcribing" and 
“amanuensis". Musicians often use the term transcribing for a process that produces 
music notation, starting either with sound or with notation Tor a different combina- 
tion of instruments and voices; either process includes music setting as a subprocess. 
Transcribing music (in the first sense of “transcribing") by computer is discussed in 
Sec. 3.2.2. An amanuemu is one who listens to a performance and writes it down in 
music notation, as in Jef Raskin's “Using the Computer as a Musician’s Amanuensis" 
[RASK80]. 

For definitions of other terms, see the Index to Definitions. 

In summary, this dissertation is concerned primarily with music retting. I com- 
pare terminology in several other fields to that used in music in Sec. 5.3. 
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1.3. PROBLEMS AND GOALS 

Music notation is like the written notation of virtually every natural language in 
that it was not designed by any specific person or group of people; rather, it evolved 
over a period of centuries. In the case of CMN, this evolution took place mostly in 
Europe betweeu about 1000 and 1600 A.D., although a small amount of further 
change has occurred continuously since then. Although CMN originated in a manner 
similar to natural language notations and has many structural similarities to them, it 
is quite different in not being based on any fixed character set, no matter how large, 
and in relying much more heavily on symbol position to convey information. This is 
an indication of the enormous complexity of CMN[3j, and is a reason why a music- 
notation formatting program is fundamentally different from, and much more 
difficult than, a text- formatting program. In a word, CMN is essentially graphical; 
written natural language is not. 

Thus, a music notation system has immediate interest from a computer science 
standpoint because it attacks a highly complex graphics problem — in fact, one of a 
complexity that has never been handled very satisfactorily by computer alone, that 
is, without a substantial amount of manual intervention.^) This is certainly true, 
because much music exists whose correct formatting requires considerable intelligence 
(well beyond the siate of the art of artificial intelligence), and some music exists 
whose correct formatting probably requires full human intelligence. These are strong 
and obviously unprovable assertions, but I will give some arguments and many exam- 
ples to support them (in Chapter 2). 

An interesting graphics problem that occurs in music notation and that has 
definite artificial intelligence implications is that of placing symbols to avoid colli- 
sions. How can it be insured that slurs (for example) do not run into noteheads? 
The problem is greatly complicated by the fact that some collisions are perfectly 
acceptable; for example, published music frequently has slurs intersecting barlines or 
accidentals (see Sec, 2.5), Even if one can detect objectionable collisions, what should 
be done when one b found? What symbol or aymbob should be moved, and where 
to? Similar situations can occur in other graphics domains, for example in cartogra- 
phy (labelling map features) and electronic design (positioning components and 
paths). Substantial work has been done on thb problem in both domains; however, 
little if any b general enough to apply to music notation. Thb b not surprising 
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because any truly general solution to this problem would really be a solution to what 
Hofatadter has called [HOFS83a] “the slippability problem" of artificial intelligence. 
I discuss collision avoidance in Sec. 5.2.1* 

Thus, one of the main goals of this work is to find usable approximate solutions 
to problems in graphics whose complete solution is certainly many years, probably 
decades, off. Additional goals are: 

(1) To gather explicit knowledge about one of the most sophisticated notationaj 
systems in existence, namely CMN. There are of course many books on music 
notation, but all, or nearly all, were written by musicians for musicians, and the 
natural result is that they make enormous assumptions. [5] A related problem 
with all of the books I am familiar with is that they sometimes give incorrect 
rules in conjunction with correct examples, apparently as a result of generalizing 
improperly. We will sec an example of this in Chapter 4 . 

(2) To investigate program portability and user interface problems in a graphics 
program of significant complexity. 

The analogy between written language and CMN made above is an extremely use- 
ful one, and 1 will return to it repeatedly throughout this dissertation. Natural 
language formatting is a process most computer scientists are at least somewhat fam- 
iliar with; an enormous amount of study has been devoted to it; and nearly all of the 
problems it involves occur in music formatting (along with numerous others). Furth- 
ermore, this analogy has barely been hinted at in the existing literature on music set- 
ting, either by hand or by computer. I will particularly rely on two well-known 
text-formatting systems for examples: Knuth’s TeJC, primarily because its underlying 
philosophy is exceptionally general and so it provides far more interesting ideas than 
do moat text formatters, and the UNIX TROFF system, simply because this disser- 
tation was formatted by it and I am more familiar with it than any other system. 

A useful discussion of many aspects of computer setting of music, including the 
history of music printing and several purely graphical problems, appears in David 
Gcraberg's dissertation [GOMB75]. Gomberg also does some philosophizing on the 
relative merits of batch and interactive approaches, from which I quote in Sec. 3.4. 
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1.3.1. ThU Work Related to Artificial Intelligence 

The relationship between this work and artificial intelligence (Al) is worth 
commenting on. A major point of this dissertation, made especially in Sec. 2.5, is 
that programming “Fully Automatic High-Quality Music Notation” (henceforth, 
“FAHQMN") is a research problem in Al, in fact one that is well beyond the 
state of the art; for precisely this reason my program, SMUT, is not an Al pro- 
gram. 1 wanted a program of practical value and felt that I could write one only 
by eschewing Al techniques and not attempting to accomplish true FAHQMN. 
Let us consider what the possible ways are of approximating FAHQMN. 

The wording cf the phrase “Fully Automatic High-Quality Music Notation” 
suggests three possible types of compromise: 

(1) Not being Fully Automatic, i.e., the user must tell the system more than the 
bare minimum that one would need to tell an expert human music engraver. 
For example, an experienced engraver can decide correctly in almost all cases 
what notes to beam together (see Sec. 2.3.3.2.2) and where to position the 
beams; however, doing so sometimes requires (as I have already suggested) 
considerable understanding of music, so no program in existence can do as 
well as a human expert. In fact, some systems never beam notes together at 
all unless they are explicitly told where each beam should begin and end. A 
variation on this is Xerox PARC's Mockingbird (Sec. 3.4), which generates a 
non-standard time notation that does not use beams at all; however, it pro- 
vides powerful interactive tools for converting this notation to CMN. The 
level of automation of a system is, admittedly, a rather furry concept: How 
automatic is a system that decides automatically what notes to beam 
together and where to position the beams; that makes frequent and elemen- 
tary mistakes; and that allows overriding its decisions? What if it provides 
no method for overriding its decisions? What if it makes only occasional, 
relatively sophisticated mistakes? This question is also related to the 
batch/interactive dichotomy. One might argue that a batch system that 
allows overriding of its decisions is de facto more automatic than an other- 
wise identical interactive system that allows overriding its decisions, since 
the manual override will be far less convenient and hence far less often used 
in the batch case. 
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(2) Not being High Quality. Some aspects of high quality in music notation are 
very straightforward* e.g., the shapes of 4 ‘fixed” symbols such as clefs and 
accidentals. Others are less obvious and interact with point 1, Full Automa- 
tion. For example, a system might — left to itself — draw a slur that would 
be fine in another context but that intersects note stems unnecessarily. If 
this system is not fully automatic and allows changing the shape and/or 
position of the slur, it is still capable of high quality. 

(3) Not covering all of Music Notation, i.e., handling only a subset of music 
notation. In this thesis I consider only CMN, which — extensive though it 
is — already excludes a great amount of music notation in the broadest 
sense. Also, as we will see in Secs. 2.3.6 and 5.2.3, most of the really hard 
problems of music notation are exacerbated by complexity of “texture”, that 
is, by the presence of multiple independent voices on a staff {see Sec. 2.3.6 
for a definition of “texture” and Secs. 2.3.1 and 2,3,2 for definitions of 
“voice” and “staff”). As a result, few systems can handle more than two 
independent voices on a staff. Again, interactions among the compromises 
exist: no system I know of handles more than two independent voices per 
staff automatically. 

In my opinion, every existing GMN setting system, including mine, compro- 
mises on not one but at least two of the three aspects — in most cases with the 
author’s conscious intent, in some without. As I have already suggested, most sys- 
tems compromise the most on point 1, Full Automation, while my system 
compromises primarily on point 3. See also Sec. 5.2,1 .4, 

1.4. METHODOLOGY AND SCOPE 

The main methodology of the research was to write a program: 

(1) Capable in design of handling virtually all Western music (art or otherwise) 
written from about 1600 to 1945, and in implementation of handling, in art 
music of about 1700 to 1935, nearly ail wind music, all string music that does 
not involve “multiple stops" (two or more simultaneons notes), texturally simple 
keyboard music (e.g. Bach Two- and Three-Part Inventions, some of Bartdk’s 
Mikrokasmo* but almost no Chopin), and many orchestral scores (limited in 
number of voices only by memory and plotter capability). More specifically, the 
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implementation is able to handle one or two rhythmically independent lines on a 
staff with each line haring only one note at a time, and any number of stares 
(again limited by computer memory). (Much string music has occasional chords 
of two to four notes. Of course, most piano music has chords of up to fire 
notes. The Bach CAaconnc for unaccompanied riolin has, in places, three 
rhythmically independent lines; piano music such as Chopin’s sometimes has 
this feature and often has lines muring from one staff to another.) 

(2) Able to produce publishable and nearly “publication" (i.e. r typical engraved 
music) quality. 

(3) Portable from computer to computer, including minicomputers and large micro- 
computers, by virtue of an appropriate programming language and style. For 
several reasons including that of paragraph 8 below, I used FORTRAN IV. 

4 

(4) Portable to any reasonable graphic output device, either raster or random scan. 
This was accomplished (though at considerable cost in efficiency and program- 
ming effort) by assuming a nistuiiiiu! set of primitives, specifically those which: 
draw a vector, write a string of characters in an undefined font, and convert a 
number to a string of characters and write it. 

(5) Efficient (in spite of paragraphs 3 and 4) and reliable enough for practical use. 

(8) Integrated with the existing Indiana University Computer Music System 
[BYRD77a] to facilitate practical use. 

This program, named SMUT (System for MUsic Transcription), is batch-oriented, 
not interactive. It is clear to me that a really useful music notation system needs to 
be interactive; however, it also needs ali of the capabilities of a batch system, so that 
I fee! essentially none of my work has been wasted. This relates to the question of 
"levels of control" in a system's user interface, which I fee! has been neglected by 
moat system designers; see Secs. 5.2.3 and 5.3.1. 

1.6. OVERVIEW 

The remainder of the dissertation is organized as follows. 

Chapter 2, "Notation Systems and Music Notation", has three main sections. 
The first, "Conventional Music Notation for the Computer Scientist", is primarily 
intended to give the computer scientist who knows little or nothing about CMN 
enough background to understand the remainder of this and following chapters. It 
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should also be of interest to persons looking for a description of music notation that 
is much more systematic than the usual ones, though it is still far from formal in the 
mathematical sense* The other two sections are “A Comparison of Notation Sys- 
tems 11 and "Counterexamples"* These are intended to back up my assertions about 
the complexity of CMN and the intelligence inquired to produce it correctly* In a 
sense, these sections provide the justification for this dissertation in that they show 
how difficult on several different levels music setting is. 

Chapter 3 is "Background and Related Work". It discusses the three parts of the 
entire CMN problem (what to print, where to print it, and printing it) and 
approaches to CMN output, and summarizes the work of several researchers on 
automating music notation from the 1 950*3 to the present. It also gives some back- 
ground for my own work* One of the other aspects of "the musicians 1 problem" 
mentioned at the beginning of this chapter is important to CMN output systems, 
namely CMN input systems; these are discussed in this chapter at some length. 

Chapter 4 describes "How SMUT Works". In order to give the reader some intui- 
tion about the problem, it begins by pointing out problems in setting several sample 
excerpts of music of increasing complexity. It then discusses the basic "Principles of 
Operation" of SMUT*s four passes, then goes into detail about some of its more 
interesting features, especially those relating to rhythm. The chapter closes with 
further discussion of the setting of one of the sample excerpts. 

Finally, Chapter 5 is "Conclusions". Although the combined efforts of many 
workers have led to considerable progress towards a really useful automatic notation 
system, music itself (and, therefore, music notation) is much too complex to yield 
fully to a mere 25 years of work. As I have already pointed out, many of the prob- 
lems in CMN setting also occur in other domains, so one might instead go back 35 
years to the first electronic computers — still not long enough to solve problems of 
such magnitude. In this chapter I discuss the work that remains to be done* For 
example, I say quite a bit about the collision-avoidance problem mentioned above, 
which is very important m several domains. I comment on the difficulty of handling 
complex textures and its relationship to collision avoidance. 1 also dbcuss "Editing 
and Formatting, Design and Drafting" and the importance and problems of systems 
that support multiple levels of control. 
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NOTES 

(1) An entire book about computers and music was published as early as 1959: [HILL59j. 

|2] To my knowledge one previous dissertation in computer science has been devoied to this sub- 
ject, namely (GOMB75J* Unlike me, Gomberg did not actually implement a music formatting 
system, although he did analyze many of the problems. There have also been several disserta- 
tions and master’s theses in both music and computer science devoted to more or less closely 
related issues, e.g. t [PRER7l|, [KNOW71}, 1MOOR75J, [REND8I|. 

(3| As further evidence, a standard text, [READ69], is 482 pages in length. See also Chapter 2, 
especially Sections 2,3.6, 2.4, and 2,5. 

[4] Several systems (e.g. Armando dal Molin’s "Music Reprographics” system JDALM75, 
DALM78J, Leiand Smith's MSS (SMIT73, SMIT78|) do quite well either by relying on a human to 
interactively "guide” the computer, or by requiring manual finishing of the printed music after it 
has been computer drawn. 

|5| The standard texts are [READ69], [ROSS70], and [STON80J. The closest tiling 1 know of to 
an explicit statement of the rules of music notation b in [ROSS70j, This is still no more than, to 
choose a number, S% explicit (as compared to the usual value of, say, 2%). 
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Notation Systems and Music Notation 


2.1. INTRODUCTION 

Of the many forma of muaic notation in use around the world, my work is con- 
cerned exclusively with the one most familiar to Westerners: the notation developed 
originally for Western music of the last few hundred years and now in use for many 
types of music, especially art music. I will call this notation ‘‘conventional music 
notation”, or <4 CMN tf for short. 

Like the written notation of virtually every natural language, CMN was not 
designed by any specific person or group of people; rather, it evolved over a period of 
centuries. In the case of CMN, this evolution took place mostly in Europe between 
about 1000 and 1600 A.D., although its roots go back far earlier and a small amount 
of change has occurred continuously since 1600. 

Understanding any notation requires an understanding of the information the 
notation attempts to capture. In the case of CMN, four types of information are 
involved, namely the four parameters of musical sound (at least according to Western 
music theory): piich, time , loudness , and ltm6re.[l] The basic principles of the 
representation ore simple. CMN uses a highly modified two-dimensional Cartesian 
coordinate system to represent the four parameters: 

(1) Pitch is represented explicitly along the vertical axis, although the representa- 
tion in nonlinear and complex. It relies heavily on the staff, a set of five hor- 
izontal lines for reference (see Sec. 2.3.2). 

(2) Time is represented fairly explicitly along the horizontal axis; this representation 
b also nonlinear and even more complex. (“Duration* 1 and, especially, 
“rhythm" are higher-level temporal aspects of music; more will be said about 
them below. Furthermore, “time" b often used by musicians as a synonym for 
“meter", but I use it here in its more colloquial sense.) 
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(3) Loudnen (closely related to “dynamics”) is mostly implicit* When it is given 
explicitly, it is by means of verbal comments, usually in Italian and often abbre- 
viated, plus a very few symbols, 

(4) Timirc (identical to “tone quality”) is represented in two distinct ways, one 
mostly explicit, the other largely implicit. The explicit method is used in com* 
positions for several players to specify which notes belong to which player and 
instrument; this constitutes a partial specification of timbre* This is accom- 
plished (to oversimplify slightly) by having several carefully aligned staves and 
using the appropriate one for each sound. Selection of a specific timbre from 
the (usually ill-defined) set available on the specific instrument is mostly 
governed by unwritten convention. When it is written, it is expressed in very 
much the same way as control of loudness. 

To restate the situation, the horizontal axis represents pitch. A very short verti- 
cal axis represents pitch; it is repeated several times, once (more-or-Iess) for each 
possible value of “player” in the piece in question. Thus, as Spiegel has pointed out 
[SPIE83a], three dimensions are squeezed into two in a way that is often used with 
graphs, by haring one two-dimensional graph of variable 1 against variable 2 fer each 
value of variable 3. Additional loudness and timbre information is given verbally, 
and still more is implicit. 

The above list gives the parameters in roughly decreasing order of importance in 
Western music theory* It is not surprising that this is also a decreasing ordering of 
the explicitness with which the parameters are represented in music notation.[2] 

A major complicating factor in CMN is that most Western music consists of 
several synchronized lines going on simultaneously* This would be only a minor com- 
plication if the lines were notationally independent and merely needed to be aligned, 
as with a sot of simultaneous equations; in fact, however, they can interact in rather 
complex way 5 (cf. Secs* 2-3.6 and 2,4). 

If CMN did not concern itself with communicating this rather large amount of 
information in an efficient, i.e., “high-bandwidth”, way, its structure could still be 
relatively simple. However, CMN is extremely concerned with high-bandwidth com- 
munication, and in particular high-bandwidth communication of music written in a 
particular style: it is optimized for music in approximately the style of Mozart or 
Beethoven, i.e*, typical Western art music of the last half of the 18th or first half of 
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the lffth century, Gomberg makes some perceptive comments on bandwidth in CMN 
and its relationship to standardisation [GOMB75, p. 7]: 

Professional musicians usually cannot enumerate [the conventions of CMN], but they 
are well aware of them through experience. The basic reason that music setting 
must abide by these conventions is that music is (often) read in performance at a 
rate much faster than most persons read ordinary text, and any deviation from the 
conventional causes the eye to hesitate and thus affects the ability of the musician to 
perform - . . The situation b aggravated by the short rehearsal time available in 
most . . . orchestras. Orchestra players do not attempt to memorize their parts, but 
instead read it during a performance. Thus it b imperative that music b printed in 
the manner expected by these musicians. 

In short, bandwidth is vitally important in CMN because music is a performing art 
and because the notation is used in real time during performances. (Another 
interesting consequence of these facts about music is the importance of properly 
chosen page turns: see Sec. 2.3.6.4.) 

Although CMN originated in a manner similar to natural language notations, it is 
quite different in that it b not based on any fixed character set, no matter how large, 
and in that it relies much more heavily on symbol position to convey information. 
We will discuss thb difference in detail in Sec. 2.4, “A Comparison of Notation Sys- 
tems”. 

It may appear at first that CMN is a gigantic “kludge” — a formidable 
hodgepodge of poorly organised and confused ideas which is simultaneously excessive 
in complexity and lacking in precision. Indeed, over the centuries dozens, if not hun- 
dreds, of attempts to reform music notation have been made, some superficial, sc^e 
far-reaching.(3j I disagree strongly with the view that CMN is badly in need of 
reform. Some of CMN is indeed overly complex, and some of it is insufficiently pre- 
cise. But for the style of music for which it evolved — if 1 can read purpose into the 
process in retrospect — CMN is a remarkably successful system. Furthermore, CMN 
has proven flexible enough to be adaptable in almost any direction. Rather than 
complain that a system that works superbly for Beethoven does not work so well for 
Xenakb, we should marvel that it works for Xenakb at all. [4], [5] 
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2.2* GENERAL DEFINITIONS 
2*2*1* What Is Notation? 

A standard unabridged dictionary, [MERR34], gives six definitions of “nota- 
tion”. The first three are not relevant to us. Definition 4 is 

act, process, or method of representing by a system or set of marks, signs, figures, 
or characters; any system of characters, symbols, or abbreviated expressions used 
in an art or science, to express technical facts, quantities, etc. 

Definition 5 is of mathematical notation, and 6 (by far the longest) of music nota- 
tion, More formally, Goodman [GOOD76] discusses notation from a logician’s 
standpoint. It is beyond the scope of this dissertation to go into detail, but he 
distinguishes between notations! schemes^ the criteria for which are purely syn- 
tactic, and notational systems, which must satisfy semantic requirements as well. 
He concludes (p. 183): 

The main corpus of purely musical characters of the system [of conventional 
music notation] . 1 1 appears on the whole to meet the semantic as well as the syn- 
tactic requirements for a notation. The same cannot be said for all the numerical 
and alphabetical characters that abo occur in scores. 

In any case, Goodman admits that his definitions are far from ordinary usage, and 
for our purposes the informality of the dictionary definition will be quite satisfac- 
tory. Any references in this dissertation to notational systems should be taken 
informally unless specified otherwise. 

2.2*2* What Is Conventional Music Notation? 

I have not really defined conventional music notation (CMN) yet. For the 
purposes of this dissertation, an informal definition will do: CMN includes any 
arrangement of the symbols in general use by composers in the European art 
music tradition from about 1700 to 1935 , used with the meanings that were stan- 
dard: (1) if the notation was made between 1700 and 1935, at the time it was 
made; (£) if the notation was made before 1700, with the meanings of 1700; or (3) 
if the notation was made after 1835, with the meanings of 1935 * Thus, I am refer- 
ring to a large set ci conventions for semantics as well as syntax, although my 
primary concern is with syntax. The endpoints 1700 and 1035 are somewhat arbi- 
trary. [6j As the above definition implies, and as ! said in Sec. 2.1, music notation 
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has actually changed continuously for centuries, although relatively slowly over 
the 1700 - 1035 period. Semantic changes within the period were more substan- 
tial than syntactic ones, but still not very great.[7] Note also that 1 do not insist 
that, to qualify, music must actually have been composed or notated between 
1700 and 1935: most of the works of Elliott Carter, for example, are expressed in 
CMN, even though they were not written until the 1940’s and after. 

2.3. CONVENTIONAL MUSIC NOTATION FOR THE COMPUTER 
SCIENTIST 

This section is devoted to a series of explanations and tutorial examples that 
assume no previous knowledge of music notation. I hope thereby to make the rest of 
the dissertation understandable to the nonmusician, especially the computer scientist. 
My intent in this section, then, is to impart a “reading knowledge” of CMN. To 
that end, 1 will concentrate on the principle# of CMN; I will also discuss many, but 
nowhere near all, of the detail* of CMN. For example, the question of when note 
stems should go up and when they should go down is not discussed here. I will, how- 
ever, point out a few “writing knowledge” problems to give a feel for the complexity 
of CMN. Two good introductions to CMN for the layperson are (SEEG73] and 
(SHAN57], For more details from a musician’s perspective, see the standard texts: 
[READQ9J, [ROSS70], and (STON80), Three more specialized books, [DONA63], 
[GOMB75], and [READ78], are also useful. Finally, [RAST82] is a good historical 
study of Western music notation. Of all these, [ROSS70] is the only one that gives 
any significant number cf explicit “writing knowledge" details, although it still omits 
far more than it includes. What [ROSS70] describes is also the closest to CMN; 
[READ89] and [STON80] include a great deal of new music notation that is not 
standard — not yet, at any rate. Chapter 4, “How SMUT Works”, gives more expli- 
cit information than any of these sources on certain topics, for example rhythm nota- 
tion and horizontal positioning of symbols (often called “justification”, or, more accu- 
rately, “punctuation”). 

2*3.1* Definitions 

Before going on, we need some additional terminology. Much more will be 
introduced as we go. 
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As I hare said, most Western music consists of several synchronised lines 
going on simultaneously; these lines are technically called twice*, regardless of 
whether they are performed by singers or instruments. Note that a multivoice 
composition may be played by one person, e.g., on the piano. The concept of 
“voice” is subtle and really needs more attention; we will return to it in Sec. 
2.3.6. 

In music in any of the styles that are notated with CMN, each voice consists 
of a series of tones, usually having either constant or (for percussion instruments) 
indefinite pitch, and intervals of silence. The symbols for the tones are called 
notes; for the intervals of silence, rests, (8j 

A i co re is the music notation for a composition to played simultaneously by 
one or more performers giving the music to be played by all performers, with 
vertical alignment indicating the temporal relationships among notes and rests in 
different voices. If there is more than one performer, the performers do not ordi- 
narily play from the score; instead each plays (or sings) from a party showing only 
what she or he is to do.[9] In such a case, the score is for the use of conductors 
and persons studying the music. 

A system is a set of two or more staves (see Sec. 2.3,2 for the definition of 
“staves”) connected by a vertical line or brace at the left, describing the music to 
be performed simultaneously by one or more performers. In a score, every page is 
made up of one or more systems. Parts can also include systems, since some 
instruments, for example the harp and the piano, are written on more than one 
staff. 

A movement is either a complete musical composition or the largest subdivi- 
sion of a musical composition. Movements are nearly always separated from sur- 
rounding movements by major changes in the characteristics of the music and are 
usually separated by silence as well. 

2.3.2. Pitch Notation 

Fig. 1 shows the piano keyboard (in relation to two staves with clefs, which we 
will discuss shortly). It is well known that musical pitches are described by the 
letters A through G. These letters correspond to the white keys on the keyboard 
in a pitch range of one octave, i.e., a frequency range of 2:1 (under ordinary 



Figure 1. Piano keyboard and clefs 
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circumstances, pitch is logarithmically related to frequency); then they repeat.flO] 
Thus, the piano keyboard covers slightly over seven octaves. It has eight notes 
called “C", one of which is called “middle C" (circled in the figure). The eight 
C’s arc in the frequency ratio (from left to right on the keyboard) 
1:2:4:8:10:32:64:128 and have actual frequencies from about 32.7 to 4185 Herts 
(cycles per second). There are many systems for combining letter names with 
octave designations; by far the most sensible is that used by the Acoustical 
Society of America, In this system the octave of the keyboard starting with the 
lowest C is labelled 1. Octaves above (to the right) are labelled with increasing 
integers, octaves below (to the left) are labelled with decreasing integers — though 
numbers below 0 are hardly ever needed! In this system, then, middle C is u C4 ,t . 
So far we have labelled all of the white keys of the keyboard, but none of the 
black keys. Well fill this gap shortly. 

Fig, 2 shows a musical itaff (plural ifauci), consisting of five parallel lines; 
this particular staff has a treble etef at its left end. As I have said, vertical posi- 
tion in CMN indicates pitch; however, the correspondence is not fixed, but 
depends on the clef. Notes are restricted to vertical positions centered on staff 
lines or on the spaces in between; each of these positions corresponds to the 
letter name of a different pitch. The letter names for treble clef are given in the 
figure. Note that E and F both occur twice, but in two different octaves, namely 
octaves 4 and 5. The other clefs are shown in Fig. 3. With 6a*j clef instead of 
treble, the bottom line of the staff would be G2 instead of E4. The correspon- 
dence between position and pitch for treble and bass clefs is shown in Fig. 1. The 
less common alto and tenor clefs respectively make the bottom line of the staff F3 
and D3. Rigid conventions determine which clef or clefs can be used for a given 
instrument or voice. Only treble and bass clefs are ever used for the piano, for 
cxamplc.fil] 

In Fig. 4 a note b added to the staff, specifically a whole note (thb term is 
defined in Section 2.3.3.1) with pitch E5. In Fig. 5 a fiat b added, which lowers 
the pitch to E-flat 5. Now we are ready for the black keys. Fig, 0 again shows 
the piano keyboard, thb time only one octave but with all keys labelled. Clearly, 
counting both black and white keys, there are 12 notes in an octave; the 13th 
note repeats the initial one an octave higher. These are equally spaced on a 



Figure 2. Staff and treble clef 



Figure 3. Other clefs 
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logarithmic scale of pitch (12], and the distance between any two consecutive 
notes b called a tcmitonc. However — to oversimplify considerably — at critical 
points in the history of Western art music, only seven of the 12 were used at a 
time to make up the basic pitch material of a composition, with the other five 
being used for embellishment; many aspects of pitch notation reflect thb histori- 
cal inequality of importance* One such reflection b, of course, the arrangement of 
the piano keyboard into black and white keys, and others are the facts that only 
the white keys have independent names (the letters A through G) and positions 
on the staff. The five black key pitches may be obtained by accidental* - -semi- 
local pitch modifiers — two of which are the flat (down one semitone) and the 
sharp (up one semitone). (Fig. 7 shows the five accidentals: double flat, flat, 
natural, sharp, and double sharp. Double flat and double sharp have the effect 
one would guess, namely two semitones in the appropriate direction each; they 
are used only under special circumstances. I will discuss the natural shortly.) 
Since there is no black key between B and C, B-sharp is the same as C f and Oflat 
is the same as B.(!3) Likewise, E-sharp b the same as F, and F-Bat is the same as 
E. I call accidentals "semi-local” because they affect, not just the following note, 
but all notes with the same letter name and octave until the end of the measure 
(defined in Sec. 2.3.3.2).[14) 

In Fig. 8 the flat accidental has been removed and a key iignature containing 
two flats has been added. The key signature is a gtobat pitch modifier, global in 
two senses. One is its scope: it applies, not merely for the rest of the measure, 
but for the rest of the movement or until explicitly changed. The other is vertical 
applicability: if B-flat and E-flat appear in the key signature (as here), B’s and E’s 
in all octaves are flatted, not just in the octaves where the flat signs appear. 
How, then, is it possible to write unflatted E if E-flat appears in the key signa- 
ture! By putting another accidental, a natural, in front of the note (Fig. 9). To 
restate our accidental-scope rule, any explicit accidental — double flat, flat, 
natural, sharp, or doable sharp — overrides the key signature or previous acciden- 
tals, with a scope of the rest of the measure. 

We now know enough notation to be able to notate most pitches in the middle 
of the keyboard, namely those that happen to fall on the staff in whatever clefs 
the instrument allows — for the piano, treble and bass. Unfortunately, most of 
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the keyboard does not satisfy this condition. We can’t even notate middle C 
(C4), which (as Fig. I shows) is above the staff in bass clef and below it in treble 
clef! Two devices exist to solve this problem: ledger lines and octave signs , 
Ledger lines (Fig. 10) are simply temporary extensions of the staff upward or 
downward. A very convenient way to extend the staff’s pitch range by another 
octave and a half is with four or five ledger lines (more or less). With more than 
this, it is too easy to misread the number of ledger lines, and something else has 
to be done. That something is the octave sign (Fig. 11), which temporarily moves 
all pitches up or down an octave: up if the octave sign is above the staff, down if 
it is below. [15] The notes in Fig. 11 have the same pitches as those in Fig. 10, 
namely D3 and A8, As the figure shows, ledger lines and octave signs can be used 
together* 

2.3*3. Time Notation 

2.3.3. 1. Notes and Rests Alone: Duration 

As I have said, time is represented in CMN on the horizontal axis: the 
music is read from left to right (except, of course, across line breaks). 

Of all the symbols in music notation — clefs, key signatures, notes, rests, 
barlines, etc. — the only ones that actually occupy time in performance are 
notes and rests. In a given musical voice, notes and rests are performed “end- 
to-end”, each one beginning at the moment the previous one ends. The rfura- 
fibn of a note or rest is just the amount of time it occupies, (The term rhythm 
refers to much more complex and higher-level concept; we will postpone dis- 
cussing it for a while.) The common basic durational symbols for both notes 
and rests are shown in Fig. 12a, and some of the relationships among them are 
shown in Fig. 126. A sixteenth-note is shown with its parts labelled in Fig. 
13.(16] It can be seen that the names of the durations are mostly ordinals of 
powers of 2. The basis of time notation is the idea that the duration of one 
specific symbol is specified (sometimes very implicitly, not to mention vaguely); 
all other symbol durations are relative to that one, in an extended binary sys- 
tem. For example, a piece might be marked “M. M. J = 80”, an explicit 
statement that there are to be 80 quarter-notes per minute. (This marking is 
called a tempo mark. Tempo marks set a global variable relating to time; they 
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Figure 12a Symbols for note and rest durations 
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are simply positioned horizontally at the point where they are to begin taking 
effect. Other tempo marks — “accelerando”* “ritardando", etc. — describe 
gradual changes of tempo, so that, on a graph of tempo against time, more* 
or-less general line segment functions are available.) Naturally, quarter-notes 
last half as long as half-notes and twice as long as eighth-notea, four times as 
long as 16th-notes, etc. There is one exception to this scheme: the whole rent 
has the duration of a whole rnesiurc, which may or may not equal a whole- 
note or two half-rests. (See below for the definition of measure.) 

Clearly the list of durations that can be notated in this way has intolerably 
large gaps. What if “third-notes” are wanted? These gaps are filled in in 
several ways. 

(1) The most general extension is the fie, a curved line that combines two 
notes of the same pitch into one continuously-sounding note (Fig. 14). 
Ties can be used consecutively, so that three or more notes are combined 
into one.(l7j 

Ties are very general, but alone are not sufficient to satisfy the needs of real 
music. There are two specific situations that are still not adequately taken 
care of; each has a corresponding notational device. 

(2) The more obvious situation is the occurrence of durations whose ratios to 
any basic duration cannot be expressed in binary with any finite number 
of digits, and which would therefore require an infinite number of tied 
notes. The “third-note” is an example. This is handled by a device that, 
unfortunately, has at least six names. We will call it the grouped it is 
also known as the artificial, irregular , or cxtrametric group , and (less sen- 
sibly) as the unequal rhythmic group or irrational rhythmic tubdiviiion. 
By far the most common example is the triplet (Fig. 15), whose three 
notes(18) are each reduced to 2/3 of their normal durations. (The little 
“3” is called an “accessory numeral”[19j.) Unfortunately, as in so many 
aspects of CMN, as soon as one leaves the most common situations, the 
rules become both complex and vague. The question here is what dura- 
tional values should be used within the groupet. Consider, for example, 
two ways of notating a septuple! with a total duration of a half-note 
(Fig. 16), The first way follows the triplet example by using within the 
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groqpet eighth-notes whose durations are greatly reduced: 7/8 » 1/2. 
(The rule here is to “retain the note- values of a regular note-division for 
the faster, irregular pattern, until the next, shorter regular division, and 
thus the next shorter note- value has been reached.” (STON80, p. 130] 
Under this rule, groupets always reduce durations.) The second way uses 
within the gronpet sixteenth-notes whose durations are slightly increased: 
7/16 < 1/2. The (relatively few) proponents of the second method argue 
that the inconsistency of having groupets sometimes shrink durations and 
sometimes stretch them is more than compensated for by the advantage 
of using notes whose effective durations are much closer to the normal 
ones. The situation is further complicated by some musicians’ using one 
method in binary rhythmic division levels (see Sec. 2.3.3.2) and the other 
in ternary division levels. All one can say in general is that notes and 
rests within groupets always have more than half their normal durations, 
and nearly always leas than their full normal durations. Fortunately, it is 
almost always possible to tell from context what the composer 
intended,|20] Nested groupets are also possible, but they are relatively 
rare. 

(3) The other problem eituation is purely a matter of readability: notes with 
durations of one-and-a-half times a basic duration ;_re extremely com- 
mon, and without some additional notational machinery, music would be 
crowded with thousands of ties. The augmentation dot (Fig. 17) allows a 
single note to represent not only these durations, but a superset of them, 

namely those of the form £<fx2T* where d is one the basic durations. 

Specifically, the above formula describes a note with n augmentation 
dots; n=0or 1 is common, 2 fairly rare, 3 or 4 very rare.(2lj The vertical 
position of the dot depends on whether its note is centered on a apace or 
a line. Dots belonging to “space” notes aiwaya appear directly to the 
right of the notehead; dots belonging to “line” notes appear in the mid- 
dle of the next space up if there is only one voire on the staff. If the staff 
is shared with other voices, dots belonging to “line” cotes may appear in 
either the space just above or the one below. For more on this question, 
see Sec. 2.3.6.2.I. 
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CMN also has a facility for describing notes of indefinite bat (asaaily) 
short duration, informally called grace note*. Their notation is just like that 
of ordinary notes, except that they are smaller and often have slashes through 
their stems (Fig. 18). No additional time is allocated for grace notes; they 
usually take time from the preceding, bat sometimes from the following, note 
or rest. 

2. 3. 3. 1.1. Articulation and Phrase Mark** The actual durations for 
which notes are sounded are usually less than the written durations, the 
missing time being filled with silence. The fraction of the total written 
duration that is sounded is affected by symbols called phrase marks and 
articulation marks . The slur is the standard phrase mark. Graphically, 
slurs (Fig. 10) are crowed lines, in high-quality notation thickened in the 
middle; they extend from one note to a later one, frequently many notes 
later. Figs. 19a and 6 show slurs with zero inflection points, c a slur with 
one inflection point; the latter two figures involve two staves, as in piano 
music. It is quite difficult to describe the set of possible shapes of slurs 
precisely, but I will say more about them in Sec. 2.4. As Fig. 19 shows, 
slurs can look very much like ties, and, in fact, the possible shapes of slurs 
are a superset of those of ties. Their meanings are also related to those of 
ties: each note in the compass of a slur should occupy its full written 
duration, with silent time reduced to zero. Henceforth in this dissertation 
“slur” should be taken to mean “phrase mark, slur or tie’ 1 unless otherwise 
noted. 

Other articulation marks apply to a single note at a time and can, 
among other things, either decrease or increase the proportion of the note’s 
time that is sounded. For examples of articulation marks, see Fig. 33c. 

2«3.3*2. Notes and Recta In Context: Rhythm 

The chapter on “Duration and Rhythm 11 in [STON80] begins with the 
statement (p. 81): 

In all other chapters, the notational signs and procedures are presented as iso- 
lated phenomena, divorced, with very few exceptions, from any musical con- 
text. This cannot be done with rhythm. For rhythm to be perceptible as 
such, a context b essential. Stated more simply: a sixteenth note by itself b 
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so more than a time rake; it becomes a rhythmic phenomenon only in the 
context of what happens immediately before and after. 

This context dependency makes rhythm one of the most difficult aspects of 
music to discuss or even to define. For our purposes, however, we may con- 
sider any particular rhythm to be a periodic pattern much like a “ruler func- 
tion" that describes the relative perceptual salience of notes and rests. Ruler 
functions are so called because of the similarity in appearance of a graph of 
one to a series of tick marks on a ruler; see Fig. 20.(22] The horizontal axis in 
the graph represents time, the vertical axis salience. I will use the term 
rhythmic strength to refer to the relative perceptual salience of a given tem- 
poral point in a specified meter.[23j The different rhythmic strengths can well 
be considered to establish a hierarchy of levels. Rhythm is expressed in sound 
in several more or less subtle ways, the most important of which b by increas- 
ing slightly the loudness of notes that begin at rhythmically strong moments. 
It is expressed in CMN primarily by such symbols as time signatures and bar- 
lines and secondarily by modifications to notes such as substituting beams for 
flags. Almost all music written in CMN is divided into measures (also called 
bars) which are delimited by barlines (Fig. 21), a measure being everything 
between two barlines. A time signature or meter signature (Fig. 22) is a sym- 
bol, most often one integer above another[24], that specifies the meter of the 
music and thereby indicates two things: the total (relative) duration of, and 
the underlying rhythmic structure of, each measure.[25] Unfortunately, the 
two things are too complex to be encoded in a straightforward way in the two 
integers. For example, 3 above 4 (which we will write “3/4") means the dura- 
tion of the measure is three quarter-notes and the rhythmic structure is that 
given in Fig. 23; 4/4 means the measure duration is four quarter-notes (one 
whole-note), and the rhythmic structure b that of Fig, 24. 6/8 describes a 
meter with the came measure duration as 3/4, but with a different rhythmic 
structure, namely that in Fig. 25. These examples suggest another way in 
which rhythm b like tick marks on a ruler, namely that divisions on all leveb 
are usually binary, but need not be. Nonbinary divisions occur in music only 
at the top level or two — rarely, at the top three leveb — and when they do, 
according to most experts, they are always ternary. 
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2.3.3. 2.1. The Classification of Meters. Several terms are in use to 
describe the different types of meters; in order to discuss SMUTs rhythm 
clarification and automatic beaming (see Sec. 2.3,3.2.2) capabilities later on 
(Sec. 4.4), we will need to know what they are. First, though, we need 
another piece of terminology. A beat is one of a series of temporal points 
that, in the words of an elementary text, [WIN079], recur regularly for any 
given section of music 

at a rate of speed that fa appropriate or "natural” for the music. Sing a 
march or a college "fight song” and clap as you sing. You will probably 
find that you are clapping at a rate of about 120 to HO claps per minute. 

You are clapping the beat. 

Another definition of "beat 11 is given in a standard music dictionary 
[APEL69]: "the temporal unit of a composition, as indicated by the up and 
down movements, real or imagined, of a conductors hand." 

[APEL69] says about meters (cm p hash mine): 

According to whether there are two, three, or four units [i.e., beats] to the 
measure, one speaks of duple (2/2, 2/4, 2/8), triple (3/2, 3/4, 3/8), or 
quadruple (4/2, 4/4, 4/8) meter ... AH these are simple 

meters , . . Compound meters are simple meters multiplied by three: com- 
pound duple (0/2, 6/4, 6/8), compound triple (9/4, 9/8), compound qua- 
druple (12/4, 12/8, 12/16). 

This rather confusing terminology may be best illustrated by comparing 
several time signatures, each of which describes a measure with a total 
duration of three quarter-notes; 3/4, 6/8, and 12/16. See Table 1. The 
boldface indicates the beat in each case. Taking 6/8 as an example, the 
table shows that a measure is composed of two dotted quarter-notes, each 
cf which is a beat: this fa the top level. Each dotted quarter is composed 
of three eighths, which fa the second level. Each eighth, in turn, fa com* 
posed of two sixteenths, which is the third and last level shown in the 
table, though one could continue the process indefinitely. Note that across 
the three meters in the table the durational units on the top two levels 
vary, but in each case the third level consists of lfith-notes. 

[APEL69J does not mention meters with more than four beats, but they 
certainly exist; for example, 5/4, 15/16, and 7/8 are sometimes called 
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TABLE X. Time Signatures and Their Usual Structures 


Division level 

3/4 

6/8 

12/16 

Top 

3 quarters 

3 dotted quarters 

2 dotted quarters 

2nd 

2 eighths 

3 eighths 

2 dotted eighths 

3rd 

2 lfiths 

2 lfiths 

3 16ths 


’'Division levels 1 ' are of the measure- Entries in bold indicate beats. 


mixed meters, since the beats are grouped alternately in twos and threes. 

Now we are ready to consider the simple-compound dichotomy. It 
relates to what happens on the first level 6e/ou? the beat: meters are simple 
if the top-level division of the beat is into two parts, and compound if the 
top-level division of the beat is into three parts. Thus, 3/4 is simple triple; 
6/8 is compound duple; and 12/10 is compound quadruple. A good rule of 
thumb, used by my program SMUT, is that if the “numerator” is a multi- 
ple of 3 but greater than 3, the meter is compound and three of the 
“denominator” durations form a beat. Otherwise the meter is simple and 
one of the “denominator” durations is a beat. Note that this is only a rule 
of thumb, not a decision procedure. None of the standard texts gives any 
rules whatsoever, but the examples all give are compatible with this rule. 

2 « 3 . 3 « 3 . 2 . Beam** Flags on notes are very often replaced by bcam$ (Fig. 
20). The basic function of beams is to increase the readability of music by 
grouping notes according to the underlying rhythm. In Read’s words 
(READ09, p. 81]: 

No comment is necessary to point out the superior clarity of the beamed 
example below as opposed to the contrasting dense forest of individual 
flags outlining the same pattern (Fig. 27). The visual advantages of 
beaming can be farther illustrated by a comparison of the following two 
versions of a compound two-beat meter (Fig. 28) 






Figure 27. Beams vs. flags 
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6/4 time is like 6/8 in consbting of two groups of three units each, not 
three groups of two units: both meters are compound duple. But in 6/8 
the units can be beamed; in 6/4 f they cannot. One more example: the 3/4 
time passage in Fig. 29a, if written in 6/8 time, would probably be beamed 
as in Fig, 29$. However, as usual, there are no hard and fast rules; in par- 
ticular, the rhythmic level included by beams b indefinite, but is influenced 
by rules like "don't include too many notes in a beamed group."l28] Very 
roughly, "too many" probably means any number larger than about 8. In 
Figure 29$, we beamed together half the measure — three eighths — but 
with 32nd notes in 6/8 time, that structure would produce 12 notes in a 
group, which would be hard to read (Fig. 30a). One solution would be to 
beam together only four-note groups (Fig, 30$). The best solution might 
be to "structure" the beams, as in Fig. 30c. This piece of notation illus- 
trates a distinction that b ocaaionally useful, between primary fceami 
(which include the entire beamed group) and secondary beams (which do 
not). In Fig. 30c, there is only one primary beam (the long one on the bot- 
tom). 

There are several more peculiarities of beams; the only one I’ll illustrate 
is the so-called fractional beam , where a beamed group includes an isolated 
note that requires more beams than either of the adjacent notes (Fig, 31). 
The question of which way the fractional beam should point is complex. In 
Fig. 31 a, there is no ambiguity, since it must stay within the beamed 
group. In Fig. 31$, the fractional beams are pointed so that the rhythm is 
clearly shown.[27] 

2. 3. 3. 2. 3. Rhythm Concluded. Movements of real music frequently begin 
with partial measures, informally called "upbeats" and formally termed 
anacruses (the singular is "anacrusb”).[28] 

For a full discussion of some aspects of proper rhythm notation, 
developing the rhythm clarification and fractional beam pointing algo- 
rithms used by SMUT, see Sec. 4.4. 



Figure 20. Beam structure 


a i J J* 

* I J J > MMM 

c 8 J j* J77J JdrJ 


2 

4 


t-igure 31. Fractional beams 

b. 

nn 1 mm 


Figure 32. Horizontal spacing 





2. Notation Systems and Music Notation 


34 


2.3. 3.3. Tbs Horisontai Axis 

As I have said, time in CMN is represented fairly explicitly along the hor- 
izontal axis. As one might infer from the presence of erenly spaced horizontal 
reference lines — the staff fines — but not. vertical ones, the roles here are far 
less definite than for the representation of pitch on the vertical axis. The 
basic idea is that horizontal distances luggetl time intervals bnt do not specify 
them: there is no isomorphism. Specifically, the ideal distance between con- 
secutive time-occupying symbols — notes or rests — increases with the amount 
of time separating them, bnt more slowly than linearly. It would be too 
wasteful of space, in a piece that includes 64th notes, to allocate whole notes 
84 times more room. We will discuss specific rules in Chapter 4, “How SMUT 
Works”. In any case, several factors result in the continual violation of ideal 
spacing rules. For example, the presence of accidentals and dots on short 
notes may force them to be given more room than they would otherwise get 
(Fig. 32). The existence in music of multiple voices which must be synchron- 
ized profoundly affects horizontal positioning rules; see Sec. 2.3.0.2, “Compli- 
cations Involving Multiple Voices", below. 

2.3. 3.4, A Non-CMN Approach to Time: Spatial Notation 

A very different and generally much simpler approach to notation of time 
has achieved fairly wide use in 20th-century music. Spatial or proportional 
notation is definitely not CMN; however, it is worth briefly describing here, 
since it is of some relevanee to computer music setting (see, for example, 
(MAXW83}). Spatial notation differs from CMN only in notation of time, in 
which respect it is somewhat like player-piano rolls: there are no time signa- 
tures and no quarter-notes, eighth-notes, etc.; instead, the horizontal axis is 
linear, and time is proportional to distance. Noteheads are not connected to 
flap or beams of the ordinary kind, but (unless very short) are followed by 
horizontal fines called “extenders” to show how long they are to be held. 
Thus, the time component of spatial notation is clearly not (in the terms of 
Goodman; see Sec. 2.2.1) a “notationai scheme”: precise interpretation of time 
values is not possible. For an extensive discussion of spatial notation, see 
[STON80], 
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2.3.4* Notation of Loudness 

The notation of loudness in CMN b not only relatively nonexplicit, it b also 
relatively simple and undeveloped. The most explicit it gets b the appearance of 
verbal comments — usually in Italian and frequently abbreviated — plus a few 
special symbols, just above or below the notes they affect. Most of the common 
markings with their abbreviations are given in Table 2, (The “hairpins” are usu- 
ally horizontal but need not be.) As b evident, most of the markings simply 
specify dynamic levels; a few, namely the hairpins, indicate gradual change from 
one dynamic level to another. The scope of markings of the latter type is some- 
times indicated clearly, sometimes not. Thus, if we assume that “gradual 
changes” are linear, these markings make it possible to specify loudness as a line- 
segment function of time. 


TABLE 2. Dynamic Markings 


Symbol 

Name 

Meaning 

PPP 

ptanissbaimo 

as soft as possible 

TP 

pianissimo 

very soft 

P 

piano 

cK)ft 

mp 

mezzopiano 

moderately soft 

mf 

mezzoforte 

moderately loud 

I 

forte 

loud 

a 

fortissimo 

very bud 

ai 

fortississimo 

as loud as possible 

— = : 

“hairpin” 

increase loudness 

cre#c. 

crescendo 

increase loudness 


“hairpin” 

decrease loudness 

dim. 

diminuendo 

decrease loudness 


“swell” 

increase, then decrease loudness 
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2.3.5* Notation of Timbre 

Notation of timbre, i.e., tone quality, in CMN b so nonexplicit that many 
works of 19th-century composers do not contain a single timbre indication other 
than the bare indication of which instrument (or yoice) is to make which notes, 
and many works of earner composers do not even include that. J* S. Bach’s great 
Art of the Fugue , written in 1750, contains barely a hint as to what performance 
medium he intended* Furthermore, when timbre notation does appear, it is 
nearly always expressed by instructions that tell the performer what to do to pro- 
duce the desired timbre — not with a description of the timbre itself.(29j For 
notational purposes, though, all that matters is that, like loudness, timbre is 
notated mostly with verbal comments above or below the relevant notes* For 
example, string instrument music frequently contains the indications “pitricato” 
and “arco”: respectively, “pluck the strings”, and “use the bow” (the latter being 
the default). The scope of each of these markings extends until it b explicitly 
overridden. However, if a composer wants some notes plucked while others are 
bowed, a non-verbal symbol is available: the plucked notes can be indicated with 
a little M +” just above or below the naieheads,[30] There arc also one or two 
types of timbre information that are conveyed by actually changing the shapes of 
the noteheads, e.g., to diamonds or H X”s. 

Even though we supposedly have now covered the notation of all four parame- 
ters of musical sound, I have just mentioned for the second time (cf. Sec. 
2.3,3.!. 1) an important feature of music notation: small symbols associated with 
a note that appear just above or below the uotehead. Discussion of this 
phenomenon really belongs in the next section, 

2.3.6. Why Things Are Really More Complicated Than They Seem 

There are several reasons why CMN is really more complicated than the 
presentation above suggests, even though the basic principles are as described. 
Some of these reasons apply to single-voice music and to individual voices in 
multi-voice music; some are a result of interactions between voices in multi-voice 
music. Still others are no' on the level of voices at all, but relate to page layout* 

I said in Sec. 1.3.1 that most of the really hard problems of music notation are 
exacerbated by complexity of texture . The term b difficult to define. It was 



2. Notation Systems and Music Notation 


37 


adopted from its usage with textiles, where it refers to "the character of a woven 
fabric resulting from the arrangement, site, quality, etc., of the fabric’s threads.” 
[WORL64J Texture in music notation has to do with voices and staves and their 
interrelationships. Any example or single-voice music has the simplest possible 
texture. Slightly more complex is music on any number of staves with exactly 
one voice on each staff; more complex still is music on any number of staves with 
two voices on some staves; and the most complex textures — and those that are 
most difficult to notate — involve staves with three or more voices and/or voices 
crossing staves, i.e., not belonging always to the same staff. 

Having said this, I must add that the term voice is ambiguous. When, for 
example, a pianist plays a melody accompanied by a succession of three-note 
chords, the chords might be counted as three voices or as one voice that is "thick- 
ened”. The latter sense is more useful from the standpoint of notation, and it is 
usually — though not always — the sense I intend. 

3.3.8.X. Complications within a Single Voice 

Our "modified coordinate system” mode! of CMN is farther modified by 
the fact that certain small symbols can appear above or below any notehead; 
these symbols can affect any of the parameters — pitch, time, loudness, and 
timbre — of that note alone. Common examples in each category of these 
symbols, which we will call "note modifiers", are given in Fig. 33.(31] 

(a) Pitch modification: mierotonal (less than a semitone) pitch modifiers. 
These are really not standard CMN, however. 

(b) Time modification: fermata or hold ', itaceato mark, tenvto mark. These 
last two are articulation marks, which we have mentioned before (in Sec. 
2.3.3.1.1). 

(c) Loudness modification: accents. 

(d) Timbre modification: "+" (plucked string, mentioned above), “0" (open 
string), etc. 

A related but more complex phenomenon is that of ornamenti, pitch 
modifiers whose notation is nsnally similar to that of the simpler sots 
modifiers. The main difference is that ornaments are like macros, each one 
expanding (often in a very ill-defined manner) into a series of shorter notes 



Figure 33. Note modifiers 
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Figure 35. Fingered tremolo 
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whose total duration b equal to that of the original note. See Fig. 34 for 
several common examples, namely one trill and two turn*. The trill always 
involves rapid alternation between the given note and one a step higher, but 
the starting note, the speed of the alternation, and the total number of notes 
are undefined. Ornaments are like macros even to the extent of allowing 
parameters; the second turn in the figure illustrates this. An ornament that b 
notated quite differently b the fingered tremolo (Fig. 35); it is actually a gen- 
eralization of the trill with an arbitrary dbtanee between the two pitches. 

Finally, character strings are used in CMN for many purposes. Three that 
have already been discussed are tempo marks, indications of loudness, and 
instructions related to timbre. Two more usages, both important, are: 

(1) In vocal music, to give the “lyrics”, the words to be sung. This b done 
simply by placing them below the appropriate staff, the first letter of 
each word or syllable aligned with the note it is sung to, or, if the word 
or syllable is sung to several notes, with the note on which it begins. 

(2) In any kind of music, for mbcellancous verbal directions to the perform- 
ers, These are most often in Italian. Many are indications of the genera! 
character or mood of a composition or passage, for example u dolce n 
(sweetly), t, tcggiero t, ( light or nimble), u tout devient eharme et douceur 
(all becomes charm and sweetness), etc. Other directions to performers 
are harder to classify. In keyboard music one finds markings like 

(play with the left hand) and so on. 

The various usages of character strings are differentiated partly by posi- 
tioning and partly by different typefaces. 

2.3J9.2. Complications Involving Multiple Voices 

Ail the complications within a single voice just discussed are relatively easy 
both to describe and to deal with. Those involving multiple voices are neither. 
Multiple-voiced music can be notated in either or both of two ways; with mul- 
tiple staves, and/or with multiple voices on a single staff. Multiple staves 
bring in some additional complexity, but the overwhelming majority of 
difficult situations arise with multiple voices on a single staff. 
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2. 3. 6. 2.1. Two or More Voices Sharing a Staff. Two voices sharing a 
staff are basically accommodated by some simple modifications of the stan- 
dard rules. For example, stems for the upper voice are always pointed up 
and for the lower voice down, and slurs, ornaments, and groupet accessory 
numerals all move to the outside. Many problems can arise, however. For 
example, the voices can cross, so that the upper voice is temporarily lower 
in pitch; in this case interference between symbols in the two voices, 
requiring some repositioning, is quite likely. With more than two voices on 
a staff, nothing b straightforward. (Bach's Six-Par* Picercare from The 
Mutical Offering is a spectacular example: it has three voices per staff 
almost continuously.) In fret, variable numbers of voices on a staff and 
voices moving from one staff to another are not at all unusual; these 
occurrences sometimes leads to ambiguities when fewer than the maximum 
number of voices are present. [STON80] includes some excellent discussion 
of such problems. 

One of the most difficult problem areas with multiple voices sharing a 
staff relates to horizontal positioning. 

2.3.5.2.2. The Horizontal Axis Revisited. At this point, we need 
another piece of terminology. A hunk is a set of symbols in CMN that 
includes one or more notes, all beginning at the same time, and that, from 
a notational standpoint, behaves as a “nearly” inelastic unit. [32] That is, 
the entire hunk can be moved left or right, but no symbol in it can be 
moved independently more than a small distance, (Hunks cannot be 
moved vertically, since this would change the pitch of the notefs).) 
Although the notes and/or rests in a hunk must all begin at the same time, 
they may have different durations and so may end at different times, A 
complicated hunk might include several noteheads on a common stem, 
accidentals and dots for some, and an accent mark. 

Horizontal positioning in multi-voice music is much more complex than 
in single-voice music. For one thing, since time is indicated on the horizon- 
tal axis, intuition demands that any note or rest beginning after another be 
indicated by its positioning further to the right, even if the notes are in 
different voices. In fact, in CMN, this is always done. Fig. 36 (page 85 of 
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an orchestral score, “The Housatonic at Siockbridge” from Charles Ives 1 
TAree Placet in New England (1911; Mercury Music, 1935)) shows an 
exceptionally complex situation. Look at the rhythm starting one quarter* 
note into the measure and ending two- quarter notes in. Here six different 
instruments are to play two, three, fire, nix, seven and eight notes, respec- 
tively, in the same amount of time, and therefore the notation interleaves 
two, three, Eve, six, seven and eight hunks in the same amount of space* 
Thus, if the voices were superimposed, their hunks would interleave in the 
correct order. 

Intuition also suggests strongly that synchronization be indicated by 
exact vertical alignment. Again, the rule follows intuition, but in this case 
significant exceptions can occur when multiple voices occur on a single 
staff. The rules depend on whether the voices have separate stems for 
their noteheads or share stems. Fig. 37 illustrates the former case. Fig. 38 
the latter. In each figure, see a for the normal positioning of two simul- 
taneous notes, b , c, and d illustrate the two basic problem situations, both 
involving notes close in pitch, in which standard practice is to move one 
notehead to the side. Positioning of the augmentation dots is also affected 
here. The first situation (6), where the notes are on adjacent space and 
line positions, is relatively straightforward and, unless the voices cross in 
two-stem notation, is always handled in the ways shown. The second 
situation (c and d) t where the notes have exactly the same vertical position, 
is more complex: in two- stem notation, if the notes have the same type of 
head, same accidental, and same number of augmentation dots, and both 
have stems {i.e., they are half-notes or shorter), than a single head is used 
with stems going both up and down (Fig. 37c). If one or more of these 
conditions is violated, as in 4 and 38c, two heads side-by-side must be used. 
{[STON80, pp. 125 -26], makes the potential difficulties here quite clear.) 
The presence of accidentals in both voices can make matters still worse, as 
in c, an “altered unison”; in one-stem notation it is written with the 
unusual “split stem”. Finally, consider /, where the noteheads are now far 
enough apart to be vertically aligned, but one of the associated accidentals 
still has to be moved. 



Figure 37. Two voices sharing a staff, iwo stems 
a. b. c d. e. f. 



Figure 38. Two voices sharing a staff, one stem 



Figure 39. Chopin: Elude, Op. 10 No. 11. 
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All these problems and mere occur in reel music. Fig. 39 is on early 
example of a split stem, from the Chopin Etude , Op. 10 No. 11 (written 
about 1832). A more complicated situation is Fig. 40, from Bach’s famous 
Chaconne (from Partita No . £ in D minor) for solo violin. Here we have 
(at least in theory) four independent voices on one staff. In this case the 
only way to show the independence of the voices that have stems going up 
is to displace one note horizontally a bit. Since the same problem situation 
as in Fig. 376 is also involved, we end up with fArce different horizontal 
positions for a single point in time. Notice that the editor of this edition 
chose to position the augmentation dot on the top note between the stems 
of the next two lower notes, presumably to minimize the horizontal dis- 
tances between the three positions. One might, of course, argue that (not- 
withstanding the way Bach wrote the manuscript) the voices are really not 
independent, since all but the top one have identical rhythms in these 
measures, and that therefore several noteheads could be put on one stem; 
this would alleviate the difficulties considerably. However, in the Brahms 
Intermezzo , Op. i IT No. 1 (Fig. 41), no such solution is possible. Here, the 
notation is only trying to show two independent voices per staff (each 
“voice" having a chord at this point), and these voices have totally 
different rhythms. The difficulties arise for three reasons. First, each 
chord involves notes on adjacent line and space positions, so two horizontal 
positions are involved within each chord. Second, the “normal", stem- 
direction- determining noteheads are on the left in the top staff and on the 
right in the bottom staff, and “the noteheads that determine stem direc- 
tion [must be] vertically aligned" [ROSS70, p. ISO]; thus the abnormal 
noteheads are on opposite aides of the aligned ones, and we are up to three 
horizontal positions. Finally, for reasons that are hard to put into words 
but obvious from the graphics, the eighth-notes synchronized with the 
chords cannot be put in any of these three positions, and we end up with 
four different horizontal positions for one temporal poini.[33] 

These last few examples are all cases where normal symbol positioning 
rules had to be modified to avoid “collisions". Avoiding collisions 
automatically is a difficult problem of computer graphics in general, and we 
will return to it in Sec. 5.2.1. 



Figure 40. Bach: Partita No. 2 m D minor for solo violin. Chaconne, beginning. 
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2.3.5.2.3. Multiple Stares for One Instrument. ! bare already men** 
turned a few simple instances of notation specific to a musical instrument 
or family of instruments* CMN is in fact loaded with instrument-specific 
features, some of which are highly complex. One of the most important 
examples is keyboard — piano, organ, harpsichord, etc. — notation, which, 
according to Read [READ89, p. 30G] is “basically geared to the division of 
labor between the hands, with every notational element affected in some 
way by this consideration.” [34] The most obvious feature of keyboard 
notation supporting the “division of labor” is, of course, the use of multi- 
ple staves — normally two, but occasionally three or even four — for a sin- 
gle instrument. One of the complications that can result from this 
automatic interdependence of multiple staves is a voice or voices crossing 
from one staff to another; another complication is stems with noteheads on 
more than one staff. Several of the “Counterexamples" below, e.g., Figs. 
53, 54, and 59, illustrate these and related situations. 

2.3. B.2.4, Multiple Volee* as a Hierarchy. Multiple-voice music can be 
viewed most generally as involving a hierarchy. This structure is impor- 
tant because, generally speaking, the more closely related two voices are, 
the more strongly their notation can interact. This is the main reason that 
the slurs jumping staves in Fig, 42 (from the Berg Violin Concerto) are so 
eurprising. Fig. 43 (from Schonberg’s Pierrot Lunoirc ) f where all three 
staves belong to one performer, is much less interesting. The hierarchy is 
easy to overlook because one hardly ever sees more than a few of its levels 
used simultaneously, but it is partly illustrated by a complex orchestral 
score like Fig. 44 (Beethoven, Symphony No . P, I, p. 1). The possible levels, 
from outermost (most inclusive) to innermost (least inclusive), are[35|: 

(1) Ensembles “to perform in different locations of the hall" [STON80, p, 
172{. Relatively few compositions (the polychoral music of the 16th 
century Venetian School, Bartdk’s Music for Strings, Percussion , and 
Cctesta f etc.) exploit this device. 

(2) Families of instruments (brass, strings, etc.). Usually indicated by 
large square brackets in the left margin and/or interruptions in bar- 
lines. 



Figure 44. Beethoven: Symphony No. 9, f, p. 1. 
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(3) Sets of like instruments (horns, violins, etc.)* Often indicated by 
small square or curly brackets in the left margin. 

(4) Individual instruments. Most commonly, each occupies a single staff. 
Instruments that take more (e.g., piano, harp) have curly brackets in 
the left margin. When two or (rarely) more instruments share a staff, 
the rules for multiple voices on a staff given in Sec. 2,3.6.24 take 
effect. 

(5) Voices within an instrument, whether on the same staff or not, 

2, 3. 9. 3. A Complication Slightly Related to Multiple Voice** Cues 

An interesting notational problem arises when, in music for several per- 
formers, which obviously implies multiple voices, one performer has a long 
period of silence. Even though CMN conveys rest durations unambiguously, H 
is clear that the performer might make a mistake and resume playing at the 
wrong time. The notational device called the cue , an excerpt from another 
performer’s part printed with smaller than usual symbols, is intended to allevi- 
ate the problem. [DONA63] comments: 

Any cue selected to aid in entry should be one that is easily heard by the 
player at rest. In many cases it may be the most prominent melodic line, but 
it need not necessarily be that; it might well be activity of immediately sur- 
rounding players. For example, if Trombone III has been resting for thirty- 
five measures and is to be ened for entry, the cue might be a slightly earlier 
entry of Trombones 1 and II if they, too, had been silent for a period. 

2.3*Q'4* Complication* Unrelated to Voices! Page Layout 

Two further complications in CMN, which I alluded to in the introduction 
to this chapter, have nothing to do with voices, but rather are matters of page 
layout. One results from a unique fact about music: it is an art that is usu- 
ally performed with instruments that occupy the performers’ hands, and often 
from written notation rather than from memory. The resulting complication 
is that of page turns. In the words of [READ09] (p. 442): 

Page turns in instrumental parts can make or break a work. Having to stop 
playing to turn a page can quite literally break the continuity of the music; 
the fact that the copyist has not provided for logical and accessible page 
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turns can easily turn the performer against the music itself* 

A comment by William A* Watkins, a commercial music engraver (in the loose 
sense of someone who sets music by computer), is even more to the point 
[WATK82]: 

... the enormous problem of [placing] page turns, particularly when there 
aren’t any convenient measures [of] rest at which to place them . . . can in- 
volve knowing which notes can be played with only one band by the particu- 
lar instrument; ensemble seating arrangements (we once had to put an in- 
struction in a clarinetist’s part to turn the bassoonist's page); whether a pas- 
sage is solo or tutti (in the event that the passage m«»f be interrupted by a 
page turn); etc. I wouldn't care to try to write a program to encompass all 
of these things. 

(Some parts, e.g., strings in an orchestra, are played largely by several per- 
formers in unison — ‘‘tutti” — but with occasional passages played by a single 
performer — “sole”.) 

As Lawrence Bodony has pointed out [B0D083], putting page turns in a 
conductor’s score presents a quite different but probably equally difficult set of 
problems. The conductor always has an empty band, so rests are not impor- 
tant, but she or he would rather not be confronted with numerous actions 
(e.g., cueing performers) to be taken instantly after a page turn. 

Finally, there is a very strong tradition in CMN that the music should not 
end just anywhere in the middle of the last page, but should completely fill it. 
There is no obvious reason for this. 

2.4* A COMPARISON OF NOTATION SYSTEMS 

Is CMN really that complexT Is it more complex than, say, Chinese writing, or 
mathematical notation (to mention two systems with reputations for great complex- 
ity)T[36l This question is too vague to answer meaningfully unless we first pin down 
the meanings of the terms “Chinee* writing” and “mathematical notation” as we 
have already pinned down “music notation” (by using the better-defined term 
“CMN”, i.e. f “conventional music notation"; see Sec. 2.2.2). 

By “Chinese writing”, or more generally any natural language writing, we will 
mean writing of ordinary text; we exclude tables, crossword parries, concrete poetry, 
and so on, which can extend the typographic characteristics of the language almmt 
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indefinitely. Similarly, “mathematical notation’ 1 herein will exclude geometric 
diagrams, graphs, and pictures like the ones commonly used in elementary calculus 
texts in discussions of volumes of rotation. These constraints are analogous to the 
constraint of restricting music notation to CMN, especially since purely pictorial 
representations have been used for music; see [STON8Q] for examples. 

To answer our question, we also need to distinguish among semantic complexity, 
tirnfaefie complexity, and graphic complexity. These three aspects — semantics, syn- 
tax, and graphics — correspond to points along a line going from most abstract to 
most concrete. The distinction between semantics and syntax is, of course, standard. 
Graphics or, let us say, typography, is concerned with mapping abstract objects (in 
English text, for example, the word “typography 11 ), into concrete ones (for example, 
the two-dimensional visual pattern resulting from printing the characters “typog- 11 at 
the end of one line and the characters “raphy 11 at the beginning of the next, both in 
10-point Palatino),[37j Typography is rarely considered along with semantics and 
syntax; however, much linguistics work is concerned with the auditory equivalent of 
this abatract-to-conerete mapping, namely phonetics. In the words of Hofstsdter 
[HOFS83a], 

Syntax Is a set of rules for mapping structures in a semantic space of high dimen* 
sionality down into a !ow~dimenakmai syntactic space. A syntactic space — usually 
oae-dimessscnal — is still an abstract space, involving a sequence of concepts, not 
tangible or risible objects. Notation is a further mapping from the abstract syntactic 
space to the concrete, usually two-dimensional, typographic space. Syntax is con- 
cerned with conversion from time-independent semantic concepts to a time- 
dependent sequence of structures which are still abstract, i.e. f nonvisual. The syn- 
tactic stage is a way station between the completely abstract semantic and the com- 
pletely concrete graphic levels. 

To clarify this, the rules for determining what character in Chinese can legally occur 
in a given context are obviously extremely complex (if indeed such rules exist, which 
is far from clear): this is a matter of both semantics and syntax. However, the rules 
that determine where and how to draw a Chinese character, once it is decided to use 
it, are quite simple: this is a matter of typography. To put it a different way, typog- 
raphy is what formatters and typesetters (whether human or computer) worry about. 
Since a notation is simply a domain-specific set of rules for representing syntactically 
ordticu information in visual form (see See. 2*2.1), it should be obvious that 
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lypagf&puk complexity Is tlie relevant one here. Therefore the relative complexity of 
meaning conveyed by a page of Chinese or of mathematical or mnaic notation need 
not be argued. 

With these points settled, I think onr initial questions (“Is CMN really that com* 
plexf Is it more complex than Chinese or mathematics?") are easily answered. 
Chinese b a good example to discuss first because its typographic complexity appears 
great (at least to most persons whose native language is English), but is in fact no 
greater than that of English — which is to say, not great at all. Written Chinese 
(Fig. 4S, written in the recently-developed “simplified characters") involves some 
thousands of rather complicated characters; however, these characters are simply 
lined np side by side in lines or columns (both horizontal, as in the figure, and verti- 
cal arrangements are actually used), and when one line or column b filled, another b 
begun. Thus, a piece of text in either Chinese or Engibh is conceptually a one- 
dimensional string of characters, all roughly the same size and chosen entirely from a 
predefined set. The complexities that text- formatting programs deal with, in either 
language, arise almost entirely from the practical requirement of breaking the text 
into numerous short segments to make it fit onto pages rather than scrolls. (38) The 
same cannot be said of the complexity of music or mathematical notation. 

Fig. 46 includes two examples of mathematical notation that appear in the docu- 
mentation for a well-known mathematics formatter [KERN75, KERN78a] to show 
the capabilities of the formatter. These two examples plus the set of simultaneous 
equations in Fig. 47 show a fair amount of complexity; let us compare the notation in 
them to the notation in Fig. 36 (the Ives page discussed previously). Neither is based 
entirely on a predefined character set: mathematics has variable-size braces of several 
kinds, fraction lines, radical signs, and such symbols “/ S II U fY'* mu9 * c ^ ,as 
note stems, staff braces of two kinds, slurs, groupet brackets, “hairpins", and gUs- 
sando marks. Each has vertical-axis components that are intrinsic (i.e., not related 
to the dimensions of the writing surface) in two different respects: 

(1) Each b composed of multiple horizontal lines of symbols where the vertical 
alignment of symbob across all lines b significant. In the set of equations it 
indicates correspondence among terms; in the mnsie it indicates notes sounding 
at the same time. 



Figure 46. Mathematical notation 
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(2) Each has, trtlAsn a single horizontal line, symbols whose meaning depends on 
their vertical position relative to some reference point. In the mathematics 
these symbols are, of course, tbe subscripts and superscripts. In the music they 
are, primarily, the noteheads, although other symbols’ vertical positions can be 
significant. 

These differences between natural language writing on the one hand, and music 
and mathematics on the other, are very deep. In a word, CMN and mathematical 
notation are fundamentally graphical, while natural language writing is only inciden- 
tally graphical. [39] 

Is there, then, any difference in complexity between music and mathematical 
notation? Yes: music notation is more complex, for at least four reasons, all of 
which hare already been mentioned; see Secs. 2.3.3.!. 1, 2.3.6.2.1 and 2.3.Q.4. For 
one thing, achieving the vertical alignment of the items in tbe horizontal lines in the 
mathematics is trivial: every hunk in every line must be aligned with a correspond- 
ing hunk (or at least a vacant spot, as in Fig. 47} in each of the other lines. In music, 
the relative horizontal positions of the hunks can be fairly complex, as in the Ives 
example previously discussed. 

Second and more importantly, below the hunk level, the horizontal lines in 
mathematics are totally independent; vertically aligning the hunks is all that is 
necessary. [40] In contrast, horizontal lines (i.e., voices) in a musical score repeatedly 
interact at all levels, sometimes to a spectacular extent. (I have already discussed 
one important cause of this interdependence, namely lines on one staff that are close 
in pitch.) A simple example appears in Fig. 48 (Beethoven, Symphony no. 9, Eulen- 
berg edition, III, p. 138) in the second, third, and fourth measures of the clarinet 
staff, where there are two voices on one staff, requiring each to make adjustments. 

The third factor in the greater complexity of CMN is a question of character sets. 
As I have said, neither mathematics nor CMN is based entirely on a predefined char- 
acter set, but in several respects CMN is much farther from having such a basis. 
One of the non-predefined symbols in CMN — the slur — is more complex than any- 
thing I know of in mathematical notation. Slurs not only do not have a fixed size or 
orientation, they do not hare a fixed anything, other than what might vaguely be 
called “siumness". An outstanding demonstration of ibis fact is found in maurice 
Ravel’s Le Tombcau dc Couperin (Fig. 49). Among other things, alurrinexi involves 



Figure 48. Beethoven: Symphony No. 9, ill, p. 136. 
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graceful curviness and forbids self-intersection. It not only docs not involve a fixed 
shape, it does not involve a fixed or even a limited number of inflection points[4l], as 
tb ; Ravel example shows. It also does not require continuity (see Figs. 42 and 55, 
the latter actually a tie), although “graceful curviness” probably implies several 
derivatives at every point of continuity. [42] This last comment touches on a deeper 
reason why CMN is further from a fixed character set than is mathematics: interrup- 
tibility is not a special feature of slurs in CMN, but is built into the system! In fact, 
almost any line-like symbol in CMN can interrupt another one— -or they can inter- 
sect. Numerous examples of these phenomena are cited in Sec. 2.5. In view of all of 
this, it would not be an exaggeration to ?s y that, in the ordinary sense of the terra 
“character set", there is no such thing as a music character set. CMN does indeed 
have many characters — clefs, accidentals, and so on — in the narrowest sense of the 
term, but it also has symbols like the slur that can only be called characters in a 
very extended sense. 

The fourth and final difference is related to the “practical requirement” of fitting 
the notation on pages instead of scrolls. In this respect mathematical notation is less 
demanding than natural language, since individual pieces of mathematics nearly 
always fit on a page. CMN, however, is more demanding than natural language. It 
has all the line-justification, running header and footer, etc., requirements, plus two 
more: page turns, and filling the last pagc.[43] 

I might add one other observation. None of the terns “natural language writ- 
ing”, “mathematical notation 11 , “CMN” is truly well-defined — all three have furry 
boundaries, notwithstanding our efforts to pin their meanings down. Therefore the 
differences in complexity I have described are more a matter of practice than of 
theory. One could probably invent a piece of notation as complex as almost any 
CMN and argue with some force that it was legitimate mathematical notation[44]; 
but no existing mathematics formatting program — for example, Knuth's T£X 
[KNUT79], the UNIX “eqn” program [KERN78aJ — deals with anything analogous to 
the above four problems, and I doubt if anyone notices the difference. All serious 
music foi matters deal with at least the first of the four. None handles the other 
three fully, and many users are acutely aware of the lack. [4 5] 

For more evidence of the trickiness of CMN, see the next section. 
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2«S« COUNTEREXAMPLES AND “FULLY AUTOMATIC HIGH QUALITY 
MUSIC NOTATION” 

I hare now presented evidence that CMN b more complex than such obvious 
rivals as Chinese writing or even mathematical notation* I have previously made 
strong claims (in Sec. 1.3) for the complexity of CMN in abstract terms, specifically 
that the correct formatting of a lot of music requires considerable intelligence 
(perhaps beyond the state of the art of artificial intelligence), and the correct format- 
ting of a small amount of music probably requires human intelligence, specifically 
“common sense" -well-known bugaboo of artificial intelligence. I promised to give 
evidence in support of this assertion; it will be presented now, primarily through an 
annotated set of examples of published music. I call the items in this collection 
“counterexamples^ because they are intended to counter the view that CMN, while it 
may have many complex details, b in principle easily mechanizable. 

One plausible objection to this collection b that these examples really are not 
“conventional music notation", but involve nonstandard extensions to it. This is 
impossible to refute absolutely, since the boundaries of CMN — like everything else 
about itl — are rather ill-defined. In particular, music notation b an evolving sys- 
tem, so it might be claimed of any particular example that it includes notation that 
really precedes CMN or that is new and has not yet become CMN* In order to 
weaken such objections as much as possible, I have included only examples published 
by well-known music publishers and only from works of major composers, written 
within the 1700-U>-1935 period I defined (in Sec. 2.2.2) as CMN. Composers 
represented include Bach, Bartdk, Beethoven, Berg, Berlioz, Brahms, Chopin, 
Debussy, Franck, Grieg, Liszt, Mendelssohn, Mozart, Rachmaninoff, Ravel, 
Schonberg, Schubert, Schumann, Scriabin. R. Strauss, and Wagner, Note also that 
thb collection b far from exhaustive: it is based on an examination of a minute frac- 
tion of the relevant musical literature. [46] 

Another possible objection to many of these examples, particularly those classified 
aa “CoUbions", b that they are simply mbtakes by the composer or editor and there- 
fore show nothing at all about music notation. Again, thb cannot be refuted totally, 
but in most cases the peculiar usage I am pointing out occurs several times in the 
movement. In any case, I think the examples support each other and tend as a group 
to undermine this objection: if similar "mbtakes" are committed to paper over and 
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over again by highly reputable composers and publishers, how can anyone insist that 
they arc mistakes! Who makes the rules! 

I want to emphasize that my point is not just that the roles of CMN are highly 
complex. Rather, 1 am making an argument about music formatting analogous to Y. 
Bar-Hillel’s well-known argument [BARHOOj about natural language translation: that 
what be calls "Fully Automatic High-Quality Translation” requires full artificial 
intelligence, i.e., human-level intelligence. What I call "Fully Automatic High- 
Quality Music Notation" (henceforth, “FAHQMN") requires human-level intelligence 
because: 

(X) No set of rules that does not incorporate a high enough degree of flexibility to 
resolve a great variety of unanticipated conflicts can possibly handle all of 
CMN. 

(2) The only way to get that much flexibility is with a system in which rules are 
not built-in but emerge from interactions on lower levels where competing forces 
can interact in parallel in such a way that superior solutions "float" to the top 
[HOFS83e|. 

(3) Finally, these lower-level pressures can involve the semantics of the music as 
well as common sense to an arbitrary extent. 

The key word is "unanticipated”: the variety of possible conflicts is so great that 
anticipating all situations is ont of the question. Consider the Chopin Nocturne in 
Fig. 51. What imaginable algorithm would be able to position the beams in the 
upper staff as shown — a notation of doubtful "correctness", but probably the best 
that can be done with this music! (One could, of course, handle it with (in the words 
of [BARH6Q]) “a completely arbitrary and ad hoc procedure whose futility would 
show itself in the next example.")[47j A few more brief examples should drive the 
point home: 

(1) Deciding where to place page turns can involve snch nonobvious "world 
knowledge" as which notes can be played with one hand on the instrument in 
question 3nd eusemble seating arrangements (Sec. 2.S.6.4). Deciding what notes 
to use as cues presents similar problems (Sec. 2.3.8.3). 

(2) Deciding where to change clefs can involve a great deal of mnsical semantics. 
(These last two questions are strongly reminiscent of the text-formatting- 


Figure 51. Chopin: Nocturne, Op. 15 No. 2. 



Figure 52. Scriabin: Sonata No. 1. 



Figure 53. Scriabin: Sonata No. 7. 
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problem of avoiding "semantically bad” line breaks discussed by Knuth and 
Plasa [KNUT81}.) 

(3) For the sake of clarity, composers and editors frequently write redundant 
accidentals, e.g., a flat in front of a pitch that is flatted in the key signature but 
that had a natural in the previous measure, as in Fig. 54. The rules for use of 
these ‘‘cautionary accidentals” given, e.g., by Stone [STON80, p, 65} are highly 
dependent on the semantics of music, especially on the complexity of the har- 
mony. (All three of these points represent a different situation from the Chopin 
Nocturne in that the question here it one of choosing the best of several ver- 
sions, all of which are absolutely standard notation.) 

The question is one of what Hofstadter calls “slippability”; see Sec. 5.2.I.4. 

I hasten to add that I do not claim that situations requiring com mouse use judge- 
ments are as frequent in music formatting as in natural language translation; they 
probably are not. But they still occur reasonably often. So it is clear that writing 
the “ultimate” music notation system — one that does FAHQMN — would be a 
major problem in machine mtel!igenee.(48] To my knowledge no one yet has even 
attempted to handle music notation in this way (nor does it make sense to do so in 
isolation from other problems)* 

Most of the examples in the following list are from piano music, which — because 
of its textural complexity — is an especially fruitful source of unusual notation. The 
examples are divided into several categories. Those in larger print with figure 
numbers are quoted here, the others are not. Bibliographic information is given in 
the format “(year of competition; publitherf editor f year of publication— code )”, In 
many cases 1 have omitted one of the dates and/or the editor’s name; nonetheless, it 
should be still be clear from the format what is what, code is tizr.tl *ehcn a single 
publication is referred to more than twice. In this case full information is only given 
the first time, but a one- to three-character code is also given that is used in later 
references, [49] 

A very interesting and somewhat similar set of examples, much less extensive but 
with detailed historical discussion, appears in the article "Notation” in The New 
^ drove Dictionary of Mu$ie and Mutician $ [GROV80). Another related set of exam- 
ples appears in [MAXW83).[5Q] 
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(1) Collision*, i.e., eases where two symbols overlap or touch. See Sec. 5.2.1 for an 
extensive discussion of collisions and how a program might avoid them. An 
intcrtceiion is a collision of two symbols each of which is linear, i.e., conceptu- 
ally one-dimensional. Intersections: 

Chopin: Nocturne , Op. 15 No. 2 (Henle=H) (stems and beams in different 
voices): Fig. 51 

Scriabin: Sonata No . 1 (1802; Internationa!, 1075=178), p. 9, 21 (slurs in same 
voice): Fig. 52 

Scriabin: Sonata No . 7(1911; 176), p. 136-37 (grace note beams and regular 
note beams in the same voice): Fig. 53 
See also: 

Bach: B-flot minor Prelude from The Well-Tempered Clavier, Book I (1722; 

Kalmus/Bischoff= KB) (stem and tie, stem and beam) 

Bartdk: Piano Sonata (1926; Boosey and Hawkes), I, p. 9 (ties and stems) 

Bartdk: Rhapeody for Piano and Orchentra , Op. 1 (1904; Schirmer), p. 8 (hairpin across 
many stems) 

Berg: Piano Scnalo, Op. 1 (1908; Universal 1926?= U28), p. 1, 4, 5 (slur and tie) 
Brahms: Rhapwody in G miner, Op. 79 No. 2 (Feters/Sauer=PS) (hairpin and stems, 
hairpin and accidentals) 

Brahms: Capriccio, Op. 78 No. 5 (PS), p. 18 (tie and stems in same voice) 

Chopin: Prelude in E minor. Op. 28 No. 4 (183S; Schirmer/Jose8y=SJ) (slur and 
hairpin) 

Debussy: Reftelo dan * Peau (1905) (slurs in different voices) 

Schumann: Piano Concerto (Eulcnberg) I, 68, 72, 85-86 (slur and hairpin) 

Scriabin: Sonata No . 6 (176), p. 111 (deliberate intersections of slurs and note 
modifiers) 

Scriabin: Sonata No . 6 (178), p. 113 (slurs and stems in same voice) 

Scriabin: Sonata No. 9 (na), p. 172 (groupet bracket and slur) 

Intersections of barlines and hairpins, barlines and slurs, accidentals and slurs, 
and octave signs and slurs are too common to be worth listing. A few examples 
of collisions other than intersections (most of fairly common types): 

Beethoven: Piano Sonata , Op. 106 (1819; Universal/Schenker, 1923), IV, p. 558 
(note head and trill) 

Berg: Piano Sonata (U28), p. 11 (performance direction and hairpin) 

Beriiox: Symphonic Fantaztique (Eulenberg), V (stem and dynamic) 

Brahms: Capriccio , Op. 78 No. 8 (PS), p. 29 (slur and character string) 

Brahms: Intermezzo, Op. 117 No. 3 (PS), p. 81 (slur and clef) 

Mendelssohn: Violin Concerto, I, m, 165 (ferroaU for one staff and another staff) 

Ravel: Siring Quartet (1903; International), I f p. 19 (tie with key signature and 
time signature) 
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Schumann: Fiona Concerto (Eulenberg), I f m. 2488 (sterna and accents) 

Scriabin: Sortofo No. 5 (178), p, 80 (ledger line, expression mark, hairpin) 

Strauss: Abo Sprach Zarathuztra (1898; Ea]enberg=E), p. 25, 29 (dynamic and 
alar for different voices) 

Wagner: Trwfon u nd holds, Licbcitad (1885; Kalmua), p, 29 (alar and perform- 
ance direction) 

(2) Linear symbols interrupted by other symbol*. Ia every case I know of, 
the apparent reason b to avoid a collbion. Interruptions of barlines are com- 
monplace. A few examples are: 

Beethoven: Symphony No. Q (1823; Eulenberg), I, p. 1 (barline interrupted by 
performance direction) 

Moiari: “Dbtonant” Quartet , K.485 (1785; Eulenberg), I, ra. 21 (barline interrupted 
by dynamic) 

Strauss: Abo Sprach Zarathuitra (E), p. 84 (barline interrupted by instrument 
number) 

Wagner: Trutan und itoldc, Prelude (Katmus), p. 5 (barlines interrupted by 
performance direction and dynamics) 

Other symbols interrupted: 

Berg: Violin Concerto (1935; Universal) (slur interrupted by notehcad): 

Fig. 42 

Debussy: Danicutez de Dtlphea from Pretudea , Book I (19X0; Durand=D) 
(primary beam interrupted by noteheads): Fig. 54 
Franck: Piece Hdrotqut (Durand), p. 28 (ties interrupted by notes): Fig. 55 

Berg: Piano Sonata (U26), p. 7 (one tie intersecting stems and interrupted by rest) 
Brahms: Waltz , Op. 30 No. 2 (International^ I) (hairpin interrupted by slur) 

Chopin: Elude , Op. 25 No. 8 (1838?; Paderewski/Dover) (hairpin interrupted by 
Gngering) 

Chopin: Nocturne, Op. 15 No. 3 (H) (ties interrupted by notes) 

Rave!: be Tombeau de Couperin { 1917: Durand, 1918=018), Prelude, p. 4 (primary 
beam interrupted by cbf) 

Scriabin: Sonata No. I (176), p. 9 (hairpin interrupted by beams) 

Scriabin: Sonata No. 3 (178), p. 108 (ties interrupted by beams) 

(3) Symbol* with nonstandard shape* or positions. In most cases the reason 
is to avoid collisions; in some the symbol is being used for an unusual purpose. 
(It b interesting to observe that when a slur begins or ends with a tied note, the 
slur b — according to [READQ9], [ROSS70], and [STON8G] — supposed to 
include the entire tied series, but in practice very often does not. The apparent 
reason b to shorten the slur and thereby make collbion avoidance easier. This 




Figure 55. Franck: Piece Heroique. 



Figure 56. Bach: B-flal Minor Prelude from The Well-Tempered Clavier, Book I. 
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b one of the most glaring discrepancies between CMN theory and practice.) 

Bach: B-flat minor Prelude from The Well- Tempered Clavier, Book I (KB) 

(two notes with the same vertical position connected with N-shaped 
beam); Fig. 50 

Berg; Violin Concerto (1935; Universal) (slnr jumping staves and performers; 
see [RAST82, p. 234|): Fig. 42 

Chopin: Etude, Op. 10 No. 11 (1832?; Paderewski/Dover) (split stem, very un- 
usual for the time): Fig. 39 

Ravel: Le Tomb ecu de Couperin (D18), Minuet, p. 21 (slur with seven inflection 
points): Fig. 49 

Schonberg: Pierrot Lunaire (1912; Universal) No. 9 ("Gebet an Pierrot"), 
m. 4, piano (slur jumping staves): Fig. 43 

Bach-Hess: Jem, Joy oj Man 't Denting (Oxford, 1926) (notcheada on wrong aide of 
stem) 

Brahms: Intermezzo, Op. 119 No. 1 (PS) (stem only one space long) 

Chopin: Prelude in C Major, Op. 28 No. 1 (SJ) (very short stems, thin beams) 

Chopin: Sonata No. 3 (Sehinner/Mikuli= SM), 1, p. 66 (short stems, very thin beams) 
Chopin: Sonata No. S (SM), I, p. 70 (slur with two inflection points) 

Rachmaninoff: Prelude, Op. 23 No. 4 (International) (slur with two inflection points) 
Schubert*. Impromptu in E-flat, Op. 90 No. 2 (Kalmus) (step-function ociava sign) 
Strauss: Alto Spraeh Zarathuitra (E), p. 141 (bent trill) 

(4) Rhythm notation. Caution is necessary here because until about 1750, 
rhythm notation conventions were considerably freer than later, especially for 
dotted notes. Violation of vertical (rhythmic) alignment (see Sec. 2.3.0.2.1 for 
discussion): 

Bach: Chaconne from Partita in D minor for Solo Violin (c, 1720) (three stem 
and notehead positions for one attack time): Fig. 40 
Brahms: Intermezzo, Op. 117 No. 1 (PS) (two stem, four notehead 
positions for one attack time): Fig. 41 
Chopin: Nocturne, Op. 15 No. 2 (H) (one horizontal position — in fact one 
notehead — for regular and quintuplet 16th, for regular and triplet 
18th): Figs. 51, 57 

Bach: C-tharp minor Fugue from The Well-Tempered Clavier, Book I (KB) (three 
notehead positions for one attack time) 



Figure 57. Chopin: Nocturne, Op. 15 No. 2, 
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Bacb-Heas: Jet a. Joy of Man's Dcririnp (Oxford) (three stem positions for one 
attack time) 

Debussy: Le t Coliin et d*Anacapri (D) (three stem positions for one attack time) 

Other rhythm notation: 

Bach: Goldberg Variation (1742; Peters), No. 26 (change of meter in middle of 
measure): Fig. 58 (initially, the upper staff is in 18/16, the lower in 3/4) 
Brahms: Capriccio , Op. 76 No. 1 (1879; PS) (written dotted half-note with an 
actual duration of eleven 15tbs): Fig. 59 (in 6/8 time) 

Grieg: yl/6umfi/aff, Op. 28 No. 2 (Schirmer) (groupet notation that is unam- 
biguous but very unusual and evidently pointless: octuplet lQths are 
identical to normal 32nds): Fig. 60 

Beethoven: Quartet Na . 7 m F, Op. 59 No. 1 (Breitkopf and Hartei), tV, pp. 
29ff. (note before barline with dot after barline[5l]): Fig. 61 

Bach: Cantata No . 1 1 (1713; Dover (Bach Gesclbchaft)), p. 86 (note before barline 
with dot after barline) 

Bach: Matt in B Minor (1732?; Bareareiter), Symbolum Nicenum, p, 140 (time signature 
of slashed "2 M f a holdover from 17th century. practice) 

Bach: Prelude in D major from The Weil- Tempered Clavier . Book II (1744; KB) (double 
time signature of slashed "C", 12/8) 

Beethoven: Piano Sonata , Op. Ill (Dover/Schenker), II, p. 604 (change of meter In 
middle of measure, hard to recognize because d unmarked triplets and because the 
new time signature should really be 3/8, not 6/16) 

Brahms: Trio in C miner. Op. 101, III (double time signature of 3/4, 2/4) 

Chopin: Prelude in C Major, Op. 28 No. 1 (SJ) (an instructive ease: the right hand 
has the same rhythm almost throughout, initially notated precisely, but after 
a few measures oversimplified) 

Chopin: Prelude in D Major, Op. 28 No. 5 (SJ) (eighth-notes starting a 16th-note 
before the barline) 

Lint: Somite 104 del Peirarem (unmarked groopets; "free" measures of much 
greater duration than time signature indicates) 

Mosart: Faniaty in C minor , K. 475 (1785; Presser/Broder) (free measures, as above) 
Rachmaninoff: Prelude , Op. 32 No, 3 (5 against 4 rhythm in grace notes) 

Scriabin: Sonafa No. ifl(I76), pp. 198-99 (two time signature stacked vertically; 

dotted barlines sometimes followed by time signatures) 

Schumann: Symphony No. I, "Spring” (Kalm us), HI, pp. 88, 102 (change of inter 
in middle of measure) 

(5) Miscellaneous. 

Debussy: La dame de Puck (D) (two clefs simultaneously active on one 
staff): Fig. 82 



Figure 60. Grieg: Albumblait, Op. 28 No. Z 
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Debussy: Suite “Pour Ic Piano "(1901; Durand), I (two clefs simultaneously 
active on one staff): Fig. 83 

Debussy: Lt $ CoUints dAnaeapri (D) (8va and non-8va simultaneously 
on one staff): Fig. 84 

Berg: Piano Sonata (U26), pp. 8 - 9 (hairpin across page break) 

Brahms: Ballad e, Op. 10 No. 3 (1858; I), p. 28 (two clefs simultaneously active 
on one staff) 

Brahms: Ballade , Op. 10 No. 2 (I) (tie between notehead and beam) 

Chopin: Sonata No. 1 (5M), I, last measure (grace note with slur to formats) 

Debussy: La Mer (1905; Kalmus), II, p. 78, harp (beams in mid~air, accompanying 
a glissando mark to suggest its speed) 

Debussy: Suite Bergamasgue (1890; Fromont, 1910), I (two clefs simultaneously 
active on one staff) 

Debussy: Voile t (D) (two clefs simultaneously active on one staff) 

Havel: Lt Tombeaa it Couperin (D18), Forlane (slur ending between staves In mid*air) 
Scriabin: Sonate No. 5 ([78), p. 90 (clef sharing borixontal space with note) 

Scriabin: Sonata No . 6 (176), p. 108, 111 (tremolo involving note cluster with split stem, 
no part of which b vertical) 

Strauss: Also Spraeh Zarathustra (E), 1st cello, p. 13 (clef sharing horiiontal space 
with note) 


NOTES 


(!| If the reader feeb that timbre b somehow more complex than the other parameters, she or he 
b certainly correct. More elaborate cl salification schemes that subdivide timbre we frequently 
given, but for the study of CMN t they are quite unnecessary. An excellent recent study of timbre 
from an experimental psychologies perspective b [GREY75J. 

|2j Tbb situation b changing. Timbre, especially, plays a much larger role in music of the last 30 
years than in earlier music. Music notation b responding by adding a large number of aymbob to 
describe timbre. See [STON80] for the most authoritative dbcussion. 

[3| A few of the better*knowa attempt* are discussed briefly in [READ89) and in the article 
"Notation" in (GROV80]. 

[4| I refer here only to ob insirimieaUd works, not bb “electroacoustic" (electronic) ones. 

[5J Thb b not to say that we should ignore the shortcomings of CMN for new music, simply that 
if zz NmN ("new music notation* ) is needed, CMN b not shown thereby to be a bad system. 
Times change. For an excellent dbcussion, see [STON80], 



Figure 63. Debussy; Suite "Paurle Piano", l 



Figure 6-1. Debussy: LesCollinesd’Anacapri. 
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(6] It is generally agreed that mensural notation, tbe previous notation system in Western art 
music, died out about 1600, but its last vestiges persisted well into the 17th century, so that 1700 
b probably a safe starting point. See for example “Notation" in (APEL69). On the other end of 
the period, after World War U, composers started inventing new notation at a rapid rate, so that 
1935 b a relatively conservative ending point. 

|7| Perhaps the most significant change took place around 1750: the semantics of the augment** 
tion dot (Sec. 2.3.3. 1) became clearly what they are now. Abo. the modem scope rules for 
mccidentab (Sec. 2.3.2) became fixed sometime between 1700 and 1750. A third change, in rules 
for clef usage, b mentioned in note 11 below. See [DART63] for a discussion of changes in the 
semantics of music notation, many very significant, since the Middle Ages. 

[8] A standard dictionary of music, {APEL89), defines natc» as “the signs with which music b 
written on a staff." Thb definition might charitably be described as half*hearted. 

|9| Actually, parts frequently include “cues", brief excerpts from other performers* parts; see Sec. 
2.3.O.3. Abo, parts sometimes give the musk for two or more related instruments, for example 
two flutes. 

[10] Thb nomenclature agrees very well with perception. Two pitches any nnmber of octaves 
apart do indeed sound very similar: the reasons for thb are not fully understood. 

(11) While these conventions have been rigid at any given point in recent musical history, they 
have not remained constant. For example, alto and soprano voice are both now written in treble 
clef. In the early 18th century, alto voke was written in alto clef, while soprano was written in 
soprano clef, which b now totally obsolete. 

{12] Thb b an oversimplification. They are equally spaced in what b now the standard tuning 
system for nearly all Western musk, “equal temperament". They are slightly unequally spaced in 
almost all of the many other tuning systems that have been used in various places and times. 
Even in Western musk, the theoretically correct equal spacing b not always used: see note 13 
below. For more detaib, see any text on musical acoustics, e.g., [BACK69]. 

(13) Again, thb b not quite true. On an instrument that quantizes pitch as the piano does there 
can be no difference, but good string players, singers, etc., often perform these pairs of notes 
slightly differently. The difference between these enharmonic notes again has to do with tuning 
systems, and we will dbcuss it In Sec. 3.2.5. Again see (BACK69). 

[H] Thb statement is not beyond criticism. For example, if a fiat appears on an E4 in treble clef 
(bottom line of staff), b an EM later in tbe same measure and staff in bass clef (one ledger line 
aberr the staff) airc flatted? The standard texts ?*y nothin* ebont thb. Abo, many early 18th* 
century composers and many 20th-century composers have used the convention that accidentals 
are absolutely local and affect only the following note and immediate repetitions of it. However, 
we are concerned here with standard practice in CMN. 
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(15) A much rarer octave sign modifies pitch by two octaves; it differs from the usual one only by 
having “15ma" or "16" in place of the “8va” or "8", 

(16) The notehead shapes shown are the usual ones, but other shapes with special meanings some- 
times occur; see Sec. 2.3.5. Also, 64th-notes and rests are the shortest that are at all common, 
but 128ths and even 256tha have been used; see for example [REA069, p. 06, 117). 

(17) (APELOO) says that the tie, “together with the bar line, b the most conspicuous achievement 
of modern notation over the earlier system of mensural notation, where it does not 
exist . . . Owing to the nonexistence of the tie in mensural notation, a note equalling five units was 
never used in duple meter . . . prior to e. 1600; only in triple meter could such a value be 
obtained, by subtracting one from six ... ” 

[18| Actually, a triplet need not contain three notes or rests. It mast simply contain notes and 
rests whose total duration is three times any basic duration. The triplet multiplies each item's 
duration by a factor of 2/3. For example, m triplet might contain two eighth-notes and two 16ths, 
for a total written duration of three eighths and total effective duration of two eighths. 


[10} The term is not common; it appears in (DONA63). Accessory numerals are often omitted 
when it is clear what is intended. For example, when 50 consecutive sets of triplet eighths occur, 
the “3” b invariably printed only for the first two or three. This is the situation with the quintu* 
plets in Fig. 51. However, they are sometimes omitted in much more ambiguous contexts; see for 
example the second movement of the Beethoven Piano Sonata, Op. Ill, and the Lbit Scnetto 104 
del Peirarca , both referenced in Sec. 2.5, and the discussion of an example by Bartok in 
(STON80] (p. 82-84). 

(20) The standard texts here are of varying usefulness. (HOSS70J b hopelessly confusing and 
smbiguous. [REAJD69J and (STON80) are better, though still not all one might want. Both 
recommend the “always-shrinking" approach. A book on 20th-century rhythm notation, 
(READ78), discusses thb and related problems at great length. 

(21) (READ69) discusses notes with one, two, and three dots only. However, there b at least one 
occurrence of a note with four dots la a well-known work, Hindemith’s Symphony Afalhie dtr 
Mater (introduction to the third movement). 

(22) The one shown b given by B{s) ~ l/n if x b rational and - m/r. in lowest terms and n b a 
power of 2, ebe P{ x) = 0. 

(23) Most music theorists prefer the term metric itrength for thb concept. See also note 25 below. 

(24) The only other possibilities that are dearly CMN are > large “C” (usually meaning 4/4), and 
a large "C" with a vertical line through it (usually meaning 2/2). 


(25) the for the rhythmic structure set up by 


time signature and underlying 


the music b metric structure. Thb term helps theorists to dbtinguiab between a composition’s 
surface rhythm, which b explicit (i.e. f audible) and varying, and its presumed background 
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rhythm, which is implicit and relatively constant. 

(2fl) None of the standard texts quite makes this statement, bat all of them imply it. See 
(READ09], p. 80; [ROSS70], p. 92 - 93; [STON80], p. 113. [DONA63] gives a more precise but 
not very accurate rule; see Sec. 4.4.2. 

|27| Again, the standard texts are of uneven quality. All three make almost identically worded, 
vague statements: “the fractional beam must point toward the note of which it b a 
fraction" — but (STONSOpa examples are relatively good, while [READ69]'* are mediocre and 
[ROSS70]’# are totally unhelpful. 

(28j These terms are generally used when the partial measure b no more than a single beat in 
length. It b not clear whether, strictly speaking, they apply to situations where the partial meas- 
ure b longer than thb. 

[29j Thb dichotomy has been referred to as the difference between "descriptive" and "prescrip- 
tive" notation. CMN b almost always descriptive of pitch and time, b often descriptive and 
often prescriptive of loudness, and b usually prescriptive of timbre. See [SEEGSSj. 

[30] The marking, strictly speaking, means "left hand pizzicato”, i,e. f the string b plucked 
not with the hand that holds the bow, but with the other hand, the one that ordinarily presses 
down the strings to control pitch. The reasons for using the left hand are beyond the scope of 
thb discussion. 

[31] Actually, not all symbols fit neitly into one category. For example, tenuto marks may indi- 
cate a change in loudness as well as in time. 

[32] There b no standard term for thb concept. The term "bunk" was introduced by Gomberg 
(GOMB75, p, 60j, although be doe* not define it clearly. [READ69J occasionally uses the term 
"structure" (e.g., p. 134). Knuth’s idea in Tgt of "boxes" [KIWTTS] b quite similar, especially 
in that a box b formed by setting the "glue" connecting its subboxes, so that the contents of the 
box become rigidly emplaced. However, hb boxes can be nested, while hunks cannot. 

[33] In thb case, the stem of the chord in the upper staff could have been pointed down, which 
wcsM have reduced the number of horizontal positions to three, but one can invent a case where 
thb could not easily be done — for example, Just lower the pitch of tne bottom note of the chord. 

(34| The labor b divided among feet as well as hands in organ music: it normally has one staff for 
the feet in addition to two for the hands. (Thb b, of coarse, a footnote.) 

(35] It b, unfortunately, nearly impossible to talk about thb subject without using terminology 
that has strong connotations of either instrumental or vocal music. But I intend to imply no res- 
triction: the same hierarchic structure occurs in all kinds of music. 

[36] I do not claim that Chinese writing and mathematical notation axe the meet complex nota- 
tion systems in existence besides CMN, but they are the most complex 1 am familiar enough with 



2. Notation Systems and Music Notation 


73 


to discuss. Other serious contenders might be Arabic writing, lab an citation (a type of dance nota- 
tion}, and chemical notation. 

[37[ [FURU82] uses this abstract/concrete mode! extensively. See Sec. 5.3. 

[38| Two complications dealt with by many high-grade text- formatting programs that have noth- 
ing to do with pages are kerning and tigaturcr, kerning b briefly discussed in note 40 below, and 
both are discussed more extensively in Sec. 5.2.1. 1. While neither b trivial, they are both of rela- 
tively small complexity and importance. Hofstadter has pointed out an additional complication 
[HOFS83aJ: Imagine a phrase in a language that b written from right to left, say Arabic, quoted 
inside a work in a language that b written from left to right such as English. If a line break b 
needed in the middle of the Arabic phrase, K b very unclear what should be done. 

[35| Note, however, that the items I excluded from natural language writing at the beginning of 
thb section — tables, crossword puxxles, and concrete poetry — all have the first of these “intrin- 
sic vertwal-utb components”, and concrete poetry might reasonably have the second. 

{40] Scott Kim has pointed out that thb b not literally correct. If the expression P n appears in 
isolation the n should probably be “kerned”, i.e., moved under the bowl of the F, like thb: , 

while in a column vector that includes both *v the V; probably be 

aligned, so that the n could not be kerned. Again, thb effect b trivial compared to what happens 
in music notation. 

[4l| The most common number of inflection points is, of course, xero. Inflection points nearly 
alwayi occur when the slur moves from one staff to another. 

{42] Some excellent discussion of the mathematics of pleasing curves b in (KNUT79), especially 
Part 1, pp. 22-27. 

{43} Another way to look at thb b in terms of dimensionality. A rigorous analysb of the dimen- 
sionality of any of these systems would probably be extremely difficult, but a few informal com- 
ments may be useful. Natural language writing involves one dimension for its succession of sym- 
bols, and something more, certainly less than a full dimension, for symbo} choice. As we have 
seen, CMN b in concept four dimensional, the four dimensions being pitch, time, loudness, and 
timbre. In practice, only two — pitch and time — are represented to any great extent; on the 
other hand, one staff frequently includes more than one line, i.e., more than one set of several 
dimensions. Even with the vagueness of all of thb, it appears that natural language writing 
involves fewer than and CMN considerably more than the two dimensions of the writing surface. 
The dimensionality of mathematical notation b still less clear. 

(44} A case in point b Fig. 50, from the PHyricai Renew A [DRAK77). I showed it to one 
mathematician with the comment that thb «w ^eiy unusual mathematical notation, to which h* 
immediately replied "Ofc, tut that's physics!” — the implication being that therefore it "didn’t 
count”. 
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[46} Why in music notation more complex than mathematical notation? This in a difficult question 
to answer, and not really germane to our purpose here; but one likely factor b their differing atti- 
tudes towards macros. Both notations have macros, but music notation allows only predefined 
ones (see Sec. 2.3.6.I), while mathematical notation allows user-defined macros, which provide a 
nyatactieally regular way to extend it. 

(40) Contributions to this collection are solicited. I would like eventually to publbh a much more 
comprehensive collection of tbb kind. 

(47| In the words of Fanya Montalvo (MONT83}, "generating CMN b an Al-coraplete problem”. 
She coined the term “Al-compiete" by analogy with "NP -complete" to suggest that there b a 
large class of problems that are not obviously related but whose solutions are in fact all equivalent 
to full understanding of human intelligence. 

[48} See [HOFS70], especially its comments on the semantic depth of music (pp. 676-77), and 
(HOFS82bj. 

[40] It b, of course, not generally possible to tell from a single published edition whether the com- 
poser or the editor b responsible for a particular piece of notation. It would be interesting to 
investigate tbb question (presumably by the usual maaico logical techniques of looking at 
manuscript sources, comparing published editions where more than one cxbta. studying the 
composer's style, etc.). 

[5G| I am concerned here with CMN that b typographically unusual. It would also be interesting 
to collect CMN that b semantically unusual, for example by being clearly unplayable if taken 
literally (every classically-trained musician has seen plenty of music that, for one reason or 
another, may be unplayable). Such notation certainly exbts, and in the works of major com- 
posers. For example, the Organ Fantasy in G Major by J.S. Bach includes a note that b below 
the range of all (known) organs. Another example: it b not too hard to find piano music, e.g,, 
Schumann's DichUrlitbe Postlude and Brahma* Op. 118 No. 1, Op. 76 Nos. 3 and 8, that includes 
swelb on individual notes; of course there b no way to increase the loudness of a note on the 
piano once it b struck. For a brief discussion of the unplayable-notation phenomenon, see 
[RAJ3T82], pp. 235!!. 

(5l| According to Head [HEAD69, p. 117), the practice of writing dotted notes with the notehead 
before the barline and the dot after it wu widely prevalent in the I6tb and I7th centuries. Head 
say* “thb practice has long since become obsolete. 1 ' However, more recent urers have included 
Bach and Beethoven (examples cited here), Brahms (cited by Read), and — a few years after our 
CMN period — BarUSk {Sonata for Unaccompanied Violin (1944), fourth movement). 
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3.1. INTRODUCTION 

The roots of automatic music setting, like those of most other recent technological 
achievements, go back much further than is usually thought. Buchner [BUCH78, pp. 
23 - 24] mentions a "melograph" invented in the 1750’s for recording performances of 
music, its primary application having been to the creation of mechanical musical 
instruments. He says, "Attempts to treat the melograph as a typewriter for music 
and nothing else did not succeed in practice.” 

Of course, it was not until the advent of the modern electronic digital computer 
that automatic music setting became practical. Scores, if not hundreds, of computer 
systems for music setting have been developed over the last 20-odd. years, and I have 
been working on mine for more than 14. In this chapter I will summarise my work 
and the work of several other researchers on automating music setting from the 
1950’a to the present.fi] 

One can view the task of music setting in the broadest possible sense as having 
three aspects. Arranged from most abstract to most concrete, they are: 

(1) Selecting: deciding what symbols to print. One might assume that decisions as 
to what symbols to print are always determined by the semantics, and therefore 
are always explicitly made by the user. There are two reasons why this is not 
quite true. The principal one involves system breakt, paints where the music 
notation b broken into separate systems, determined by the positioning process 
described in the next paragraph but nearly always at barlines. The syntax of 
CMN is such that system breaks often cause new symbols, particularly slurs, to 
be generated. Less important, it b very convenient for the user to be able 
invoke high-level functions of a system that generate symhob to be printed, e.g., 
automatic deebiona sz —hat zzi— to beam together, er rey •yrtem'? automatic 
rhythmic clarification feature, which can replace one note with several (see Sec. 
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4.4.1). In any case, selecting the symbols to print is a relatively small part of 
the task. 

(2) Petitioning: deciding where to print the symbols and, for some symbols, decid- 
ing on their site, orientation, and/or shape. This aspect, freqaentiy termed 
"formatting 1 *, is the central part of the task and involves formidable difficulties. 

(3) Printing the symbols, i.e., being a "music typewriter" with no concern for the 
syntax or the semantics of CMN. (The music typewriter, incidentally, b a very 
real device: see Sec. 3.3.2.) Thb b relatively easy; the only real problems are 
related to device independence. 

In describing what various systems do, I will sometimes refer back to these three 
aspects. 

One aspect of the "musicians' problem" mentioned at the beginning of thb disser- 
tation b particularly relevant to CMN output systems, namely CMN input. (I mean 
here input of the information contained in CMN, not necessarily of the notation 
itself, although — with optica! scanning — that b one possibility.) As I have already 
said, FAHQMN — Fully Automatic High-Quality Music Notation — b far beyond the 
state of the art of computer science today. One of the possible compromises (see Sec. 
1.3.1) is to abandon full automation, i.e., to support some amount of interaction. In 
such a system one cannot always separate the input part of the system from the out- 
put. In any ease, editing and formatting of any type of material are closely related 
activities, and input can even be considered a type of editing. Much confusion has 
been caused by authors (writing mostly about text processing) who have not been 
careful enough to make these distinctions. Since in thb dissertation I concentrate on 
formatting, I will discuss music systems that are primarily input or editing systems 
only when they have formatting capabilities that are interesting in themselves. Oth- 
erwise, I may mention them, but will not go into detail. 

The breakdown of work into input, editing, and formatting b natural to many 
problems in a variety of disciplines, although terminology varies so much from field 
to field that the similarities of the tasks are not usually noticed. I will discuss such 
questions in Sec. 5.3. 

In thb chapter, 1 assume the reader has a certain amount of background 
knowledge of computer graphics. For background informaiiou, see [FOLE62j or 
(NEWM79J. 
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3.3. APPROACHES TO CMN INPUT 

Before a program can print oat music, it moat obviously be given, or must com- 
pute for itself, a precise description of that music in some form. What that form 
should be, however, is not so obvious. If the music b to be “fed in" by a person 
rather than composed by a program, the question is in some sense the inverse of the 
CMN output problem toward which the current work b primarily directed. I say "in 
some sense" because the strict inverse of CMN output would be direct pattern recog- 
nition of CMN by computer. Indeed, a fair amount of research has been done on just 
thb problem. However, several alternatives exist: recognition of digitised sound; 
"menu selection"; alphanumeric character languages; and performances on 
claviers — piano-type keyboards — or other musical instrument input sections con- 
nected directly to a computer. The borders between these methods are not always 
clear-cut, especially between the last two. In my opinion, as of thb writing, the only 
practical methods are menu selection and those somewhere along the spectrum 
between character representations and directly- connected instruments. I will now 
discuss ali five "pure" methods as well as various compromises. 

3.3.1. Pattern Recognition of CMN 

Research on computer pattern recognition of CMN by optical means includes, 
to date, two dissertatious: those of Dennis Pruslin [PRUS66] and David Prerau 
[summarited in PRER71]. Pruslin handled only an extremely limited subset of 
CMN in just one measure on two staves. Prerau handled a much more substan- 
tial subset of CMN, but still without tempo, dynamic, or phrase markings, with 
only one voice on a staff, and again on only two staves. It b interesting to note 
that little has been done in thb area for more than ten years. One might infer 
that thb b because the problem has been totally solved, but thb b certainly not 
the easel The opposite view b much more accurate: the problem has been pat 
aside by researchers as too difficult to be solved yet, especially in view of the fact 
that other methods are usable (and indeed preferable for some applications). Thb 
b so notwithstanding the optimism of a 1072 review of these two dissertations 
(KASS72]: 

... the logic of a machine that "reads" multiple parallel stalls bearing po- 
iylynear {the reviewer's written-musical analogue of ‘'polyphonic"! printed music 
in at kaat sac "fount" [sic] and site can lie seen to be no further than another 
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couple of M.I.T. dissertations away. 

(Pruslin and P reran were both at M.I.T.) This may be correct, bnt no farther 
dissertations on the subject — from M.I.T. or elsewhere — have been forthcoming. 
The only later attack on the problem to my knowledge is that of Wittlich and 
Martin [WITT74], who worked on it until about 1973. In Wittlich ’s opinion, the 
project was not very successful. They attempted “right off the bat” to handle 
much more complex music than either Pruslin or Prerau, namely a full page of 
piano mnsic by Debussy, complete with chords, dynamics, slurs, etc. Wittlich 
now feels that this was too ambitious, but that the main problem was the 
difficulty of obtaining high-quality microfilm for the optical scanner they used, 
which was of such high resolution it would “see” every blemish (W1TT82). 

In view of the tremendous strides technology has made in recent years, the 
time may be ripe for another attack on pattern recognition of CMN. 

3.3.3. Recognition of Music from Continuous Digitised Sound 

Another method that appears not to be practical yet is recognition of music 
and production of CMN for it from continuous sound. However, the situation 
here is very different, in that transcription of sound into CMN has been an active 
area of research in recent years. This task is frequently done manually by musi- 
cians under the name “taking dictation". For a computer, the task involves enor- 
mous signal- processing and psychoacoustics problems, somewhat similar to those 
of continuous speech recognition, especially with multi-voice mnsic when more 
than one note is being played at a time (which, of coarse, is the usual case). 
Determining from cor,. - nous digitized sound even so basic a fact as a note's pitch 
can be amazingly difficult when one is dealing with the complex timbres of com- 
mon musical instruments. Even when the pitches and durations of all the notes 
have been determined, the remaining work is far from trivial and, in fact, is ident- 
ical in most respects to that of producing CMN from a performance on a 
directly-connected instrument; see Sec. 3.2.5. There is one significant difference 
between the two methods: low-level acoustic information from earlier phases that 
is not available with a directly-connected instrument may be useful in guiding 
high-level decisions made after this point of the process. We will see an example 
of this shortly. 
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The very first attack on this problem seems to have been that of James A. 
Moorer [MOOR75, MOOR77].[2] He described his work as follows: 

A piece of polyphonic [It., moltrroiee] musical sound is digitized and stored in 
the computer. A completely automatic procedure then takes the digitised 
waveform and produces a written manuscript which describes in classical musical 
notation what notes were played [via Lehnd Smith's MSS system]. We do not 
attempt to identify the instruments involved ... It would appear that it is quite 
difficult to achieve human performance in taking musical dictation. To simplify 
the task* certain restrictions have been placed on the problem: (1) The pieces 
must have no more than two independent voices. (2) Vibrato and glissando must 
not be present. (3) Notes must be no shorter than 80 milliseconds. (4) The fun- 
damental frequency of a note must not coincide with a harmonic of a simultane- 
ously sounding note of * different frequency. The first three conditions are not 
inherent limitations in the procedures, but were done simply for convenience. 

The last condition would seem to require more study to determine the cues that 
human listeners use to distinguish, for example, notes at unison or octaves ... In 
general, the system works tolerably well on the restricted class of musical sound. 

Moorer also remarks that “[T]he process is extremely costly in terms of computer 
time." The basic method he used was the “heterodyne filter", actually a directed 
bank of contiguous sharp-cutoff bandpass filters, each of which is intended to pick 
up one partial. In a survey of techniques that he did not find useful, Moorer 
points out that, for musical sound, “we cannot rely on the presence of the funda- 
mental, or on the hope that the fundamental will be stronger than the second 
harmonic." Thus, such obvious techniques as counting zero crossings are unwork- 
able. 

More recently, Pisrcralski and Galler [PISZ77, PISZ81] “refined monophonic 
p.e„ single-voice] music transcription to a fine art" [MOOR82], 

The most recent work I know of is an ambitious effort, still in progress as of 
this writing, by Chafe, Foster, and others [CHAF82, FOST82], They are applying 
artificial-intelligence techniques, including some developed for analysis of speech 
and other non-musical audio information, to the problem as part of a project to 
develop (among other things) “an intelligent editor of digital audio". They take 
advantage of the availability of low-leve! (acoustic) information in a flexible way 
to guide high-level decisions: in the words of [FOST82], “The system makes deci- 
sions, but they are not necessarily final. The original sound is always retained 
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and is reexamined in part from time to time to refine the hypothesis.” Using such 
techniques they have implemented “higher-level processing methods [that] 
attempt to dednce the musical and notations] contexts that have been supplied 
by hand in all of the previous systems.” The musical and notations! contexts 
referred to include the clef, key signature, enharmonic notation (see Sec. 3.2.5), 
and time signature. 

3.3.3< Menu Selection 

Input with the menu-selection method involves selecting CMN symbols from a 
menu and constructing a page of music by giving the computer positioning infor- 
mation for each symbol that requires it (some symbols in some contexts can go in 
only one place). The menu may be displayed on a CRT with selections made by 
any pointing device, as in several music editing systems — for example, Donald 
Cantor’s early one and Rebecca Mercuri’s more recent MANUSCRIPT system 
[MERC81] — or it may be in hardcopy form on the surface of a digitising tablet, 
as in the editing system of Gary Wittlich and his associates [WITT78].[3] In both 
of these systems, feedback is given by displaying the music on the screen of a 
CRT while it is being constructed. Encoding a piece of music is done by moving 
the cursor to the menu to select a symbol, then (in most cases) moving the cursor 
to the desired portion of the music notation to deposit the symbol. This is 
repeated ad infinitum, although most systems allow depositing instances of the 
same character several times without returning to the menu in between. In a 
way, this is not much different from a character representation (see the next sec- 
tion) where the characters are musical symbols rather than those or, say, ASCII. 
The main difference b that with menu selection, position information b not 
included in the character code, but instead is given directly by pointing. 

Jeffrey Hass has suggested a technique that combines the optical pattern 
recognition and menu seleetion approaches [HASS83]. He suggests that the user 
actually draw the music on a tablet and that a program recognize the symbob 
drawn and convert them to some code. Ideally, the recognizing program would 
provide feedback by displaying each symbol, appropriately cleaned up in both 
shape and position, as it b recognized. Thb b much easier to program than pure 
‘‘pattern recognition" of the kind described in Sec. 3.2.1 because symbob are 
entered one at a time, x there is no problem of figuring out what b part of one 
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symbol and what is not, and they are entered on line, so the program can ask for 
immediate clarification of an item if it needs it.[4] 

3,2*4* Alphanumeric Representations 

The simplest and most commonly used method is the one-dimensional 
alphanumeric or pure character representation, ft requires no special hardware 
and is readily handled by conventional programming languages. Unfortunately, 
alphanumeric character notation is also the least satisfactory, because, aa we saw 
in Chapter 2, CMN is very different from and much more complex than the 
natural language writing for which alphanumeric notations were originally 
developed. Numerous character-based "music input languages”, all of which to 
my knowledge are regular languages (in the formal language theory sense of "reg- 
ular"), have been developed; for descriptions of several typical ones see [BROO70]. 
Among the character-based languages discussed in that volume, two that deserve 
special mention here are DARMS (originally called the "Ford- Columbia 
language") [BAUE70, ERIC75, GOMB75] because it is the best-known, probably 
the most complete, and was originally developed for music printing; and MUS- 
TRAN, because it is the one I use. 

The DARMS music input language appeared in about 1965. It was the fruit 
of a project intended "to develop a code system that will enable us to print music 
under computer control by using currently available photocomposition equip- 
ment." [BAUE70J DARMS not only b exceptionally complete in covering music 
notation, but also appears to have good facilities for dealing with the extensive 
repetition often found in music notation. However, it has the (in my opinion) 
outrageously poor design feature of using letter names for durations and numbers 
for vertical positions — exactly contrary to the universal practice of musicians, 
since in practice vertical position means pitch. For example, in DARMS, U Q" 
means quarter-note, "E" eighth-note, "S" sixteenth-note, etc. For thirty-second- 
notes and all longer durations, the system is relatively mnemonic, but the "S" one 
would like for sixty- fourth is already taken, and in fact DARMS abandons any 
real attempt at being mnemonic for shorter durations as well. Fig. la is a simple 
example of CMN; the DARMS cede for it is given in Fig. lb. Sixty-fourth and 
nh<\Ttj~ durations ore rc!™Tc!j uol Laving good mnemonics for them is 

not too serious, but reversing the obvious mnemonic representations for pitch and 



Figure 1. DARMS and MUSTRAN 
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duration is surely one of the worst possible mistakes for a music input language. 
Imagine a programming language in which identifiers are numeric, while digits are 
represented by letters: instead of 

AREA := 3.1410»R«2; 
one would have to say something like 

10:= T.OFOS*6**W; 

— the absurdity of which is self-evident. The music notation equivalent is not 
that bad, but it is bad.[5] 

MUSTRAN was developed by Jerome Wenker [WENK70, WENK74]. The 
MUSTRAN code for Fig. la is given in Fig. lc. For a much more extensive exam- 
ple, see Sec. 4.8. Wenker originally developed MUSTRAN for ethnomusicological 
purposes, i.e., for encoding folk music of various origins, so that the original ver- 
sion lacked means of representing such art-music symbols as slurs, bowings, 
fingerings, etc. In MUSTRAN II, however. Winter added a great deal of art- 
musie notation; as a result, in terms of representing CMN completely, MUSTRAN 
II is probably second only to DARMS. In my opinion, MUSTRAN is also far 
more mnemonic than DARMS, and it has better software support — a translator 
and a library of utility routines — than any other alphanumeric language. On the 
other hand, the current translator is extremely batch*, and even punc heard-, 
oriented. Also, MUSTRAN’s facilities for reducing repetitious coding are very 
limited. These facilities could, of course, be provided in a preprocessor, although 
almost certainly not as well as if they were integrated with the input language in 
a single processor.jflj This is especially true of use in an interactive mode, with 
feedback in CMN appearing on a screen as the mask is being entered, which is 
highly desirable. 

3.2.5. Directly-Connected Musical Instruments 

By “directly-connected" musical instrument input, I mean that the channel 
transmitting information to the computer does not employ sound or its electronic 
equivalent (i.e., a signal which ; amplified and fed into a loudspeaker, would ideally 
produce the same sound). Instead, the coupling is on a higher, more symbolic, 
level: the information the computer receives explicitly taya what pitches are 
being sounded at what times by specifying one of a small repertoire of discrete 
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possibilities. The difference is that between a player piano roll of u certain piece 
and a phonograph record of the same piece.{7] Loudness information is optional. 
Doing this is really practical only with instruments that inherently quantise pitch 
and time, particularly keyboard instruments. When it is done, however, the 
signal-processing and psychoacoustics problems mentioned in Sec. 3.2.2 are 
bypassed. (To date, the musical instrument used for direcWconnection encoding 
has nearly always been a clavier: an electronic organ keyboard, sometimes sensi- 
tive to the velocity or pressure of the keystrokes in order to capture loudness 
information. The only other instrument whose use I know of is the guitar, input 
from which has recently been developed by New England Digital Corporation 
[NEDC83a].} Other nontrivial problems remain, for example, rhythm fitting, 
rhythm clarification, voice assignment, and enharmonic notation. 

Jcf Raskin has described the problem of rhythm fitting, also called guanfiza- 
tion, as follows [RASK80]: 

Have t ptrsen r?mg a metronome play on ssne instrument crx quarter note, ir 
succession, at (tbej tempo ( J = 120, i.e., 50/100 second per quarter] . . . The 
resulting data might well look like the data in [Table 1|, which came from an ex- 
periment conducted with a push-button switch attached to my Apple n comput- 
er. The data was produced from the playing of an experienced musician and yet 
is irregular. There are two reasons the results from this very simple piece seem 
so ragged. First . . . the actual duration of each note must be shorter than the in- 
dicated length in order to leave a short period of silenee [between eonsecutive 
notes of the same pitch] . . . Another reason lies in the normal variations in hu- 
man motion . . . The player was thinking of six equal notes, filling a measure as 
shown in Fig. 2a. But the computer . . . received a series of rather irregular 
numbers. It would take some clever programming to determine that all of these 
notes were intended to be the same length. A moderately clever program might 
produce the music notation shown b Fig. 2b. 

Actually, it is hard to imagine how a program would produce the transcription of 
Fig. 2b; what seems to me far more likely b the transcription of Fig. 2c. But 
both versions are equally undesirable, and for the same reason: they include 
details of the performance that not only were not intended by the performer, but 
seriously obscure what ass intended — n series of notes of equal duration.]©] 

An aspect of the rhythm fitting problem that has been almost totally ignored 
b that of automatically quantizing to groapets when appropriate. Nearly all 
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TABLE 1 


Not* 

Starting time 

Note length 

1 

0 

32 

n 

*» 

63 

34 

3 

101 

37 

4 

167 

22 

5 

210 

28 

6 

268 

30 


All times we given in hundredths of a second. 


systems simply generate a non-groupet approximation and require the user to edit 
the gronpet notation in. As far as I know, the only method that has been 
described in print is my own, developed for the Wittlich et al mnsic editor and 
described in [WITT78J. 

A problem related to rhythm fitting is rhythm clarification; it is discussed at 
length in Sec. 4.4.1 of this dissertation. 

A very difficult but (in practice) less troublesome problem is that of correct 
co ice auignmcnt, i.e., deciding winch notes belong to which voices. [9] One might 
be tempted to assume that this can be done simply by ordering the notes by 
pitch, that is, assigning the highest note to voice number 1, the next highest to 
voice number 2, etc. However, such an assumption is often wrong, especially in 
music for several performers; voices often cross.[10] Furthermore, the number of 
notes sounding at any one time may vary wildly over the length of a piece — in a 
single piano piece, often from zero to eight, and sometimes more. If fewer than 
the maximum number of notes are sounding at a given moment, how can one 
decide which velcec ere the active ones* Answering this undoubtedly requires far 
greater explicit knowledge of the semantics of music than anyone now possesses. 
Still another problem is that in some music, particularly piano music, a voice can 
include several simultaneous notes, i.e., chords (see Sec. 2.3.6). The developers of 
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Mockingbird stndied the voice- assignment problem extensively and foond it 
intractable, at least for piano music [MAXW83]. It b less troublesome than the 
rhythm problems discussed above only because it can be circumvented fairly 
easily, though at some cost to the user: by having her or him tell the system 
what notes belong to what voices. Thb can be done cither as the music b being 
entered (for example, by encoding one voice per pass) or at a later time. 

Still another problem is that of choosing the correct enharmonic notation, also 
called note ipclling. Should a note be written as C-sbarp or D-flatf To mnsi- 
cians, thb b far from a trivial question; neither b it a trivial problem to decide 
from a keyboard performance which form to use. The depth of the problem is 
perhaps suggested by the fact that it has been studied extensively by a psycholo- 
gist, H.C. Longuet-Higgina [LONG78]. He compares the relationship between C- 
9 harp and D-fiat to that between homonyms in natural language. Chafe et al 
(CHAF82] attack essentially the same problem with sound input in their system 
discussed in See. 3.2.2. 

I have by no means covered all of tbs problems involved in transcribing music 
from what is basically a performance. As Raskin points out, “musical notation 
contains both more and less information than b contained in the performance”; 
many of the problems are consequences of thb fact. See his article [RASK80] for 
a relatively nontechnical but thorough discussion. Another discussion, with some 
excellent examples of CMN for piano that would be very hard to produce 
automatically from clavier-type input, appears in the paper on the Mockingbird 
system mentioned before [MAXW83]. 

3.2.8. Real Music Systems and a Comparison of Input Methods 

As 1 have pointed oat, the dbtinction between two of the five 
approaches — the character and directly-connected instrument ones — is not at 
all clear-cut, and various compromises between them have had considerable sac- 
cess. Perhaps the most useful way to classify the compromise approaches b with 
two related dichotomies, one having to do with the encoding of time, the other 
with, the encoding of pitch. There are encoding schemes where the entry of dera- 
tion (and therefore rhythm) information b proportional to real lime, and encoding 
schemes where it b not. I will henceforth abbreviate these descriptions to “real 
time" and “non-real time". Similarly, some encoding schemes in essence use 
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position along a physical line — most often a line across the keyboard of a 
clavier — to represent pitch, and some do not. The former method might be 
termed by analogy “real pitch”, the latter “non-real pitch". We may then say 
that an encoding method that b both realtime and real-pitch b definitely in the 
directly-connected instrument category; a method that b neither, definitely is 
not; and a method that b one bat not the other, might or might not be. In 
actual usage, one combination — real time and non-real pitch — seems never to 
occur.[ll] 

An encoding system that uses real-time and real-pitch encoding has great 
advantages in that it reflects in a straightforward way both standard mnsical per- 
formance methods and CMN. but there are some tradeoffs to consider. 
Specifically, abandoning real time eliminates totally the rhythm-related problems 
discussed in Sec. 3.2.5, but introduces a major loss of efficiency for the many 
potential users who have spent thousands of hours learning to play the instru- 
ment in question fluently and. of course, in real time. We will return to the 
efficiency question shortly. Abandoning real pitch b harder to justify except on 
the purely pragmatic basb that alphanumeric keyboards with digital interfaces 
are much easier to find than claviers or other musical instruments with such 
interfaces. However, one can use an alphanumeric keyboard, simply interpreting 
the keys in an appropriate way: say, labelling consecutive keys in the middle row 
of the keyboard with white-note pitches and appropriate keys in the top row with 
black-note pitches. Thb b crude, but still fairly effective. 

Table 2 gives the real-time and real-pitch status of a number of computer 
music input and setting systems. One other characteristic b indicated, namely 
whether the systems give immediate CMN feedback. “C” following the name of 
the system means that it can give CMN feedback, “S”, that it can give feedback 
in spatial notation (CMN except for time notation; see See. 2. 3. 3.4). Prom a user 
interface standpoint, CMN input b like many other tasks in that well-designed 
feedback can make an otherwise awkward method of interacting with a system, 
such as alphanumeric music encoding, quite convenient. It should be noted that 
CMN feedback is distinct from CMN output not only conceptually, but usually 
also in practice. For feedback purposes, rapid response b much more important 
than high-quality printing; for final output, the reverse is true. I will say 
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TABLE 2 



Non-real time 

Beal time 

Heal pitch 

Dal Molin (C) 

McLeyvier 2 


Scan-Note 

Mockingbird (S) 


McLeyvier 1 

Synclavier 


NEDIT 

Wittfich 

Non-real pitch 

Kornfeld (C) 

MSS (C) 

MUSTRAN/SMUT 

NOTATE 



something about the hardware implications of this observation in Sec. 3.3.S. 

Systems shown in the figure are Armando Dal Molin’" «ystem; David McLey’a 
McLeyvier in two different modes; Dean WallrafTa NED1T, developed at the MIT 
Experimental Music Studio [WALL78]; Kjaer’s Scan-Note system; William 
Kornfeld’s LISP Machine music editor; my own MUSTRAN/SMUT system; Gary 
Nelson’s NOTATE; Leland Smith's MSS; the Xerox PARC Mockingbird system; 
New England Digital Corporation's Synclavier system; Gary Wittlich and associ- 
ates’ system (WITT78]; and the University of Toronto Structured Sound Syn- 
thesis Project's Ludwig [REEV78], All of the systems for which references were 
not given are discussed, with references, in See. 3.4. 


Dal Molln's “Musicomp" terminal, like Ms alder PCS-300, uses his “PCS" 
{Pitch-Character-Space) encoding technique. The terminal has a double key- 
board, as shown in Fig. 3. For pitch information, it has an ingenious (and 
patented) variant of the clavier with four rows of seven keys each, corresponding 
to the white notes in four octaves of a clavier (an additional octave above and 
below ts also cncodsblc). This might be Called “semi-rea! pitch". The division of 
the keyboard into four one-octave chunks is to facilitate touch typing. A conven- 
tional alphanumeric keyboard b used to specify the character (note, rest, or 
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other). Each symbol is entered as a combination of a pitch, a character, and a 
following series of spaces (hence the "PCS” name), the number of spaces giving 
the item’s duration — two for the shortest basic duration in the piece, three for 
the next shortest, etc. The device gives feedback in CMN on a CRT as the music 
is being entered. 

All of the other real-pitch systems use conventional claviers. The McLeyvier 
offers the user a choice between real-time and non-real-time encoding; giving the 
user this option is very sensible, though the non-reai-tirae technique it uses b (as 
of this writing) quite awkward. 

Leland Smith’s MSS system points up the importance of good feedback. With 
MSS, the user can sit at a graphics terminal typing her or bis music in an 
alphanumeric language and see it immediately in CMN, so that errors — either 
semantic or typographic — can easily be spotted and corrections made on the 
spot. The method of making corrections also gives good feedback by indicating 
graphically what region is currently being manipulated. Smith comments 
(SMIT82): 

I 2nd that about 50% of my time b spent typing in the raw data and 50% is 
spent in screen editing. I have hopes of gradually changing this ratio to about 
75%:23% but I doubt that it can get much better than this considering all the 
variable situations that can occur on a page of music. 

One can get some feeling for the efficiency of various encoding techniques from 
the fact that it took an excellent pianist, Adrienne Gnidec, about five hours to 
encode the Chopin Etude in C Major, Op. 10 No. 1, in MUSTRAN. (See 
excerpt — about one-sixth of the piece — in Fig. 4, printed by my SMUT system 
from Gnidec’s data. The left hand part is slightly simplified.) She eould play the 
piece on the piano in leas than two minutes, some 150 times faster. This com- 
parison is, however, very misleading. First, the average encoded performance will 
include a few mistakes (by the performer or the rhythm-fitting routines or both), 
and in any case Raskin’s dictum that a performance provides both more and less 
information than its notation applies; for both reasons, some additional time will 
usually have to be spent entering information in another way. An objection of 
another sort b that Gnidec could not have played the Etude in two minutes had 
she not already spent many hours practicing, not just the piano, but that specific 




Figure 4. Chopin: Etude in C Major, Op. 10 No. 1 (set by SMUT). 
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piece. This is certainly true, although many musicians have already spent large 
amounts of time practicing pieces whose encoding is desired; this was in fact the 
case with Gnidee and the Chopin Etude. 

The Etude is normally printed in four to six pages, so Gnidec's encoding took 
about an hour per page. Dal Molin estimated in 1975 [DALM75] that a page of 
music could be encoded in from 5 to 20 minntes in his "PCS” system on the ter- 
minal he was then using. This is probably close to the optimal speed for any 
encoding method. 

3.3. APPROACHES TO CMN OUTPUT 

The question of an approach to CMN output involves no difficult 
"human/machine interface" questions, as CMN input does. It is mostly a question of 
what hardware is used to produce the finished "original” pages of music, suitable for 
reproduction by conventional processes if desired. (Gomberg (GOMB75, pp. 26 — 29) 
discusses this question; however, I do not feel his comments are very accurate, par- 
ticularly now that seven years have elapsed since he wrote them.) 1 will also com- 
ment here on hardware for CMN feedback, even though, strictly speaking, feedback 
is part of the input process; and I will make a few software-related comments. 

The choice of output device is not as critical as it might on first glance appear. If 
a system is designed with reasonable concern for device independence, it can easily be 
modified to use another output device, at least of the same general type as it was 
designed for (and in a loose sense of “easily"). Thus the category of output device is 
much more important than the exact device. Also, systems that drive random-scan 
devices can nearly always be adapted to rastosean without mueh trouble, but the 
converse is not true. See Secs. 3.3.5 and 3.3.6. 

To give some perspective, I will begin by talking about three important noncom- 
puter approaches to must* setting. For more information on these methods and on 
the history of music setting, see Ross’s book [ROSS70]. 

3.3.1. The Standard of Comparison: Plate Engraving 

Traditionally, plate engraving has been the standard sf comparison for music 
setting. Engraving is still generally considered the best music-setting method, but 
it requires a high degree of skill and is clearly a dying art. Most music published 
by major music publishers from the very early 1700’s until very recently was 
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engraved, although movable type was in widespread use for centuries and several 
other methods — pen-and-ink autography, transfer processes, music typewriter 
(Sec. 3.3.2) — have recently come into wide use. Strictly speaking, “engraving" is 
a misnomer, since music engravers do most of their work with sets of punches, 
reserving the "gravers" (the cutting tools, which have points of different sites and 
shapes) for stems, hairpins, slurs, etc. A more accurate term for what music 
engravers do, occasionally used, is “plate punching”. In any case, the engraver 
inscribes the musical symbols into the aurfece of a soft metal plate, the notation 
going from right to left (so that an impression will go from left to right). For 
examples of the music engravers’ “character set" or punches, as well as a 
thorough discussion u? plate engraving, see [ROSS70] (note, however, that Ross 
uses the term "engraving" in a highly generic sense for several different methods, 
and uses "plate engraving" to refer specifically to the use of gravers, punches, and 
metal plates). 

3.3.2. Important Precursor*: The Muaie Typewriter 

Before discussing hardware for computer output of music, I would like to say a 
few things about an earlier and simpler approach to mechanising the process of 
music setting, namely the music typewriter. Ross [ROSS70] discusses music type- 
writers at some length; the following is b?sed on information he gives. 

The first music typewriter was built around 1910. There are several kinds of 
muaie typewriters, but nearly all are actually modified conventional typewriters, 
to which they retain many similarities. Major differences are twofold: the 
mechanisms for positioning the carriage horizontally and vertically, and, of course, 
the character set. As we have seen, the positioning requirements music imposes 
on its symbols are far more complex along both axes than are the requirements of 
text. To accommodate this, different music typewriters have a variety of mechan- 
isms, ranging from keys that move the carriage in tiny increments to (in one case) 
relying entirely on manual rotation of the platen. The character set of a typical 
music typewriter — the Effinger “Musiewriter" — is shown in Fig. S; it contains 
SB characters. It can use paper with preprinted staves, or staves can be typed 
with the character consisting of five horizontal lines. Notice that some large sym- 
bols such as the treble and bass cicf are broken into pieces. Symbols such as slurs 
and nonhorizontal beams are not provided, even in sections; these must be added 



Figure 5. Music typewriter character set 
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Figure 6. The Indiana University Computer Music System 
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manually. (Some music typewriters do support slurs in a crude way, by providing 
several different area that rr^y be used to begin and end slurs. Apparently, one 
then types a horizontal line to connect them.) 

At first g!«j:ce, it may appear that music typewriters are very similar to ordi- 
nary typewriters in that both automate the same low-level portion or the setting 
task, namely (in terms of my summary at the beginning or this chapter) Aspect 3: 
the actual printing of the symbols. In actuality, conventional typewriters also 
handle automatically (with their automatic forward spacing) or semi- 
automatically (with the carriage return, tabs, margin stops, and margin bell) a 
significant fraction of the simple character positioning required in their domain, 
while music typewriters cannot handle automatically cny of the very complex 
positioning their domain requires. Furthermore, conventional typewriters can 
actually print all of the characters needed for tasks of fair complexity, while 
music typewriters we lacking (especially in slurs and beams) for all jobs but the 
simplest. Thus, conventional typewriters automate all of Aspect 3 and much of 
Aspect 2, while music typewriters do not even automate all of Aspect 3. The serf* 
ousnesa of these limitations is attested to by the fact that most musicians, com- 
posers included, have never even seen a music typewriter. (12] In Ross’ words, 

. . . may professional engraver knows that a good band-copier can put out a page 
of music at least twice as fast ts any machinist ... It b elementary that much 
experience is required — beyond regular musicianship — to achieve proper nota- 
tion. In short, one would be well advised to put no stock in salesmen's claims 
that their machines are speedy, require little practice and effort, and produce 
high quality work, all in one. 

3*3.3* Important Precursor*: Movable Type 

In movable type printing, as with the music typewriter, each character 
appears on a separate piece of metal or other material. The form of the character 
may either be raised above, in the same plane as, or lowered below the surface; all 
three methods have been used. In any case things are arranged so that, when the 
piece is presssed against an inked surface, ink will be picked up in the right 
places. Printing from movable type was in heavy use for text from the time of 
Gutenberg, around H50, until the rise in recent years of the phctctypccctt-r (see 
Sec. 3.3.4). One might therefore guess that r-omeone has tried printing music 
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from movable type, especially since many of the difficulties of printing music with 
a fixed character set are not obvious on first consideration. This guess would be 
correct. In fact, music was first set with movable type in the late 15th century 
[R.OSS70]. Soon thereafter it became the dominant method of music setting, a 
position it held until the rise of plate engraving (Sec. 3.3.1). A modern font may 
contain from 400 to 500 characters; it supports slurs and beams in the same 
"crude way” as do music typewriters, discussed in the previous section. Since a 
satisfactory alternative — plate engraving — became generally available, around 
1700, the mechanical problems of getting numerous tiny pieces of metal to fit 
together well, in addition to the character-set-related problems I have already dis- 
cussed, have discouraged the use of movable type except under unusually favor- 
able circumstances. Such circumstances have generally involved relatively simple 
music with extensive text on the same page. For instance, hymnals and musical 
examples in encyclopedias have often been printed with movable type. For many 
years, all of the musical examples in the Encyclopedia Brittanica were set this 
way. 

Movable type differs from music typewriters in allowing a much larger charac- 
ter set — a significant advantage — but disallowing overprinting — s major disad- 
vantage. Limited though music typewriters are, movable type printing is for 
most purposes just as bad. [13] 

3.3.4. Modern Character-set Devices 

The hardware that has been used for computer printing of music might be 
classified broadly into general graphics devices with no built-in orientation 
towards music, and devices with "music character sets”. I argued in an unpub- 
lished paper in 1877 (BYRD77b] that "there really is no such thing as a machine- 
independent music character set”; this attitude certainly fits my thesis, in Sec. 2.4 
of the current dissertation, that CMN is deeply graphical in nature. Nonetheless, 
as the previous two sections have shown, extensive use has been made of music- 
setting methods involving devices with a limited number of fixed characters. In 
computer-based efforts, basically two snch devices have been used: the fully 
farmed character (i.e., non-dot-matrix) impact printer equipped with a special 
music character «et, and the ahatotupesetter (often called simply typesetter). 
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The impact printer has on the order of a hundred music characters, which it 
can overprint. This is rather like the music typewriter except for one major han- 
dicap: the impact printer can position characters only at grid points of a rather 
low-resolution grid, while the music typewriter has fairly high resolution both hor- 
ieontally and vertically. Considering that music typewriters already have serious 
problems with such common symbols as slurs and nonhorizontal beams, it is clear 
that one should not expect much from the fully-formed character impact printer. 

The other character set device — the phototypesetter — is vastly mere flexi- 
ble, and indeed has already produced very high-quality music notation. For the 
last 20 years the phototypesetter has been the standard machine for setting 
books, magazines, and newspapers. A detailed discussion of phototypesetters is 
far beyond our scope here. Suffice it to say that there are two kinds, called photo- 
mechanical (the traditional kind) and digital (which have been in widespread use 
for only a few years). Both kinds produce output on film, not plain paper, and 
both have their characters organized into fonts of a hundred or so each. (For 
music setting, of course, special fonts are needed.) Only photomechanical 
typesetters have been used for music setting thus far; I will discuss them first. 
Photomechanical typesetters work by shining light through templates or the 
desired characters, one after another; a complicated optical system, usually com- 
bined with motion of the film, then enlarges or reduces the image and deflects it 
to the proper piaee on the film. These devices have relatively large character 
sets — perhaps one or two thousand characters — which they can position on a 
page with fairly high resolution — * typically .002 inch or so — and print with very 
high quality. In general, however, this is still not enough to draw really good 
slurs and beams, and in both of the projects described below that use photo 
typesetters (those of Dal Molin and Watkins) slurs and beams are added by hand, 
at least in some eases. Gomberg criticizes phototypesetters on one additional 
ground, namely that very large characters such as the staves must be assembled 
from smaller characters and that these machines do not have good enough repea- 
tability to do so satisfactorily. I believe this judgment is now overly harsh, 
thanks to recent advances in technology, especially the advent of digital 
typesetters. 
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[SEYB83] surveys the state of the art in digital typesetters. Digital 
typesetters are superior to photomechanical ones in nearly all ways except that 
photomechanical ones are available for a lower cost. As a result digital machines 
have virtually taken over in all applications except those where the cost of the 
typesetter is crucial. Digital typesetters are extremely high-resolution 
programmable-character-set devices. Many models can have hundreds of fonts, 
hence tens of thousands of characters, on line at once: the only limitation is 
available mass storage. Some expose the film with lasers, while moat use CRTs. 
Typical resolution b 1000 points per inch across a width of 8 inches or more.(l4] 
(The CRT-based typesetters achieve far greater resolution than CRT-based 
dbplay terminals are capable of by employing optical systems something like 
those of the photomechanical typesetters, so that the image on the CRT at any 
moment corresponds to only a small part of the output page.) Most digital photo- 
typesetters make images with a method that is a hybrid between raster- and 
random-scan techniques: they scan the image area in a raster pattern, but rather 
than the {ideally zero-diameter) dots that arc ordinarily used in raster graphics, 
they write variable-length parallel strokes, and they generally do not spend time 
traversing blank areas. Digital typesetters show promise of developing into 
extremely high-quality general graphics output devices ideal for music setting; 
the main thing lacking in current models is a general-purpose interface (current 
models hare very specialised interfaces designed for typesetting text with limited 
graphics of the kind commonly encountered in newspaper and magazine text). 

An obvious disadvantage of all devices with characters in hardware, including 
photomechanical typesetters, is lack of flexibility: the character set * jnnot be 
changed easily. An advantage of these devices is “offloading the CPU”: all a com- 
puter needs to do to make such a device produce a character is to tell it which 
character and where, which can save a substantial amount of computation. 
Programmable-charaeter-set devices like the digital phototypesetter offload the 
CPU without sacrificing any flexibility, and are therefore very attractive. A still 
newer type of hardware that has these advantages b the /a# er prinfer. Devices of 
thb type have much in common with digital typesetters. The laser printer has 
major advantages in convenience, since it uses plain paper instead of film, and 
lower cost. It Iio» two main disadvantages: lower resolution (current commer- 
cially available models have resolutions of at most 480 points per inch) and, in my 
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experience, poorer and less consistent contrast. 

3.3.5. General Graphics Devices 

The subject of graphics hardware in general is a vast one. I will make only a 
very few general comments, plus some specific to music notation, here. For more 
background including definitions of the terms I use, see [FOLE82], especially 
Chapter 3, or [NEWM79), 

For our purposes* two of the standard ways of classifying graphics output dev- 
ices are useful: they may be ranrfonwean (also called vector-drawing, stroke, or 
caJ/iprapAic) or ra#fer-#ean, and they may be hardcopy or display (non-hardcopy). 

In general, random-scan devices, either display or hardcopy, have one major 
advantage: they can have much higher resolution than raster-scan deviccs.(l5| 
They have some major disadvantages. They are severely limited in availability of 
colors or levels of gray scale. Also* random-scan displays have flicker problems 
when the image to be displayed gets too complex, and random-scan hardcopy dev- 
ices are (in most applications) much slower than raster devices. Either type of 
display can produce a complicated image on a screen in a few seconds or less. 

Now, how does all of this apply to music setting! I have already observed 
that a computer music-setting system may need CMN for two distinct purposes, 
feedback and final output. The number of colors or gray levels available is 
irrelevant for final output, since CMN uses neither color nor gray scale. For feed- 
back, especially in editing, some way of distinguishing part of an image from 
another part can be extremely useful in any domain, including CMN; this can be 
done with color or gray level, or with other mechanisms — for example, reverse 
video or blinking, [16] For final output, hardcopy is required, and resolution is usu- 
ally paramount, so the best random-scan devices are preferable to any raster-scan 
device. For feedback, a display device is obviously more convenient than hard- 
copy, and both types of display are quite fast, so the response-time limit of the 
hardware is not an issue; but flicker is. In terms of flicker, music notation is 
unusually demanding (an observation, incidentally, I have not seen made before). 

On a random-scan display, flicker is, of course, always a potential problem. It 
i* not ordinarily mentioned a* a serious problem of raster displays, and, generally 
speaking, it is not. Low vertical-resolution raster displays (say, 256 pixeb or less) 
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nearly always have a fast refresh rate, usually 60 Hr, which eliminates Sicker 
totally* For higher vertical resolution, Foley and Van Dam describe the situation 
[FOLE82, p. 488): 

Many raster displays (as well as commercial TV) use an interlaced sea a: ail odd- 
numbered scan lines are displayed in 1/60 of a second, and ail even-numbered 
scan lines are displayed in the next 1/60 of a second, for an overall frame time of 
1/30 of a second* This interlacing helps to avoid Sicker because adjacent scan 
lines are displayed 1/60 second apart, even though each scan line is only 
displayed each 1/30 second. Flicker will occur for horizontal lines one pixel wide 
since they are refreshed only at 30 Hi. 

Unfortunately, numerous horizontal lines one pixel wide, and long ones at that, 
are the one thing almost guaranteed to appear in any segment of music: the staff! 
Foley and Van Dam continue: 

. . * many modern systems . . . doable the rates* halving the frame time to 1/60 
second. This removes all vestiges of flicker, but at the cost of more (and/or fas- 
ter) refresh buffer accesses per second and significantly higher-bandwidth 
deflection amplifiers. An alternative is of course phosphors with a longer decay 
time, but then afterimage* become a concent* 

3.3.9* Representation and Hardware 

A classic survey of hidden-surface algorithms [SUTH74] introduces the terms 
imape spaee and objeet space for two fundamentally different internal representa- 
tions of graphic data. The terms are not widely used outside of hidden-surface 
algorithms, but they will be useful to us. In image space, the desired 2~ 
dimensional picture is represented by a 2-dimensiona! array with resolution equal 
to that with which it is eventually to be displayed; in other words, the represen- 
tation is a bitmap, and its only primitive is the pixel. Thus, the representation is 
of the desired image and, in fact, is isomorphic to it in raster-scan form. In object 
apace, the objects that may appear in the picture are represented with any resolu- 
tion adequate for the ultimate display — in practice, usually with much higher 
resolution than the display. The primitives for a hidden-surface program are typ- 
ically polygons (the faces of the objects in the scene); for music notation, they 
might be notes, rests, clefs, text, etc. Thus, the representation is on a higher 
level. It describes, not the image to be displayed, but the scene of which it is an 
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image. For moat purposes, algorithms using image-space representations are pre- 
ferred because they are more efficient. However, they have the disadvantage of 
being resolution-dependent and therefore one must know in advance the resolu- 
tion of the display desired. Furthermore, they are quite impractical for high- 
quality random-scan devices. There are two reasons for this. First, imag^.-space 
representations obviously require execution time proportional to the numoer of 
pixels in the image, and in computer graphics high quality implies high resolution, 
i.e., a large number of pixels. A computation speed that is acceptable for a 
512x612 CRT display is likely to be out of the question for a 15000X15000 page 
on a mechanical plotter. Second, image-space data is inherently in raster form, 
which is generally quite difficult to convert to a vector form that will really look 
equivalent on most vector devices.[17] So, if a program is to drive all kinds of 
graphic devices, object-space representation is virtually a necessity. (In spite of 
all this, it turns out that, for music retting, some type of auxiliary image space 
representation is also very desirable; see Sec. 8.2.1.) 

3.4. A BRIEF HISTORICAL SURVEY OF COMPUTER MUSIC SETTING 

The following historical surrey of computer music retting is based on one that 
appeared in a 1074 paper of mine [BYRD74]. In that paper, I classified graphic out- 
put devices in the same way as I did above: into those having musie characters in 
hardware and those not having them in hardware. I also classified systems in an 
obvious way as basically either “batch" or "interactive". I think this classification is 
still useful. In fact, it might be argued that the major change in musie setting sys- 
tems between 1074 and now (1083) b the same as the major change in natural 
language text systems in the same period: an overwhelming preponderance of batch 
processing boa been replaced by a preponderance of interaction, with hybrid systems 
(sometimes called “integrated editor/fomatters") now being very important. 

David Gomberg has made some interesting philosophical comments on the issue of 
batch versus interactive computer music retting. In hb dissertation (GOMB75] and a 
paper based on it [GOMB77], he describes the design of an ambitious and highly 
automated batch system and criticises others for relying on interaction. The follow- 
ing quotation [GOMB75, pp. 33-34j makes hb position clear. 

The eventual goal of this project (Gomberg’sj is to take people out of the process to 
the fullest degree possible ... It b expected that for certain complex situations, a 
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small residue of autographic work will be necessary, bat that most compositions of 
ordinary difficulty will require no hand finishing. (LelandJ Smith takes a different 
view and interlaces human and cor ater activities. Permitting explicit interaction 
has allowed Smith to produce first level results quickly, and some of what he has ac- 
complished is quite encouraging. Nonetheless, by not excluding human activity from 
the start, he will likely find it difficult to do so bier. There is a very basic reason for 
removing human activities from the system, namely that these activities require 
knowledge and skill in many ways comparable to those of an engraver or antogra- 
pher . . . 

Gomberg also gives a brief survey of early music setting systems. 

A much more recent and extensive but much more genera] survey paper is 
relevant to this discussion: "Document Formatting Systems" by Furuta et al 
[FURU82j. It b concerned with the preparation of documents of all kinds, which (at 
present) means primarily text, with extensions to mathematics, tables, and line draw- 
ings, although it touches on other domains, including music notation. 

I shall now begin the survey. The reader should bear in mind that these systems 
have widely differing aims: some were really intended as music editing, not setting, 
systems, in particular "Mockingbird” and Kornfeld’s system. See Sec. 5.3. 

The first work on computer setting of music I know of was done by Lejaren Hiller 
and his associates [HILLflSj. They were working on this problem at least as early as 
1901, using the ILLIAC I computer with a modified Musicwriter music typewriter for 
both input and output. The main modification to the music typewriter, other than a 
digital interface, was installing motors and circuitry to provide discrete control of the 
vertical axb. Hiller’s was essentially a hatch system with the unusual feature of 
CMN input. It produced high-quality single-voice-per-staff scores, but was totally 
dependent on one-of-a-kind devices both for computing (the ILLIAC I) and for 
input/output (the modified musie typewriter). Far more important, however, its 
software hardly went beyond simple, low-level control of the hardware capabilities of 
the music typewriter to automate any part of Aspects 1 and 2 of the music settting 
process. In fact, this system shonld be regarded mostly as a batch editor and hardly 
aa a formatter at all: [HELL65] emphasizes its value for making corrections witbnnt 
having to redo portions of the music that are unchanged. 

Harry Lincoln's batch program [LINC70] used DARMS input and drove a stan- 
dard impact line printer equipped (in the manner discussed in the last section) with a 
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print chain having music characters. It produced rather low-quality output and was 
quite limited in the complexity of music it could handle. With such hardware, as I 
have suggested, this is hardly surprising. 

To my knowledge the first music setting system that did any real formatting, and 
the first using widely available hardware, was that of A. James Gabura 
[CALC37].[18j By 1057 his batch system MUPLOT was running on an IBM scientific 
computer with alphanumeric input and driving a standard mechanical plotter. It did 
single- voice- pc r-staff scores of high quality but with rather severe limitations. 

Another early batch system was that of Norbert Boker-Heil (BOKE72), which 
took input in the ALMA language and again used a mechanical plotter. It could do 
single- voice-pcr-st all scores of somewhat lesser quality than Gabura’s. This was 
perhaps the first batch system whose capabilities approached the point of practical- 
ity. 

I have already mentioned David Gomberg's proposed batch system Tor music set- 
ting [GOMB75, GOMB77]. He planned to have input in the DARMS language. The 
results of Gomberg’s work are 100-odd pages of narrative text and no CMN output, 
so his work must be evaluated very differently from others', most of which have pro- 
duced very little text but substantial CMN. Gomberg describes in considerable 
detail, with data structures, a method for punctuation (see Sec. 4.5) in rhythmically 
very complex contexts, including nested groupets; he gives an algorithm for determin- 
ing beam placement and slope that is probably better than any given in any book on 
notation (see See. 4.2.1); and he proposes a data structure for the overall internal 
representation of a piece. 

Gary Nelson did some interesting work on music setting in the early and middle 
1970s. His first system, NOTATE1, was written at Purdue University; it did one- 
voiee-per-staff scores in a notation that differs from CMN in employing spatial nota- 
tion of time (see See. 2.3.3.4). NOTATE1 produced output on an electrostatic 
plotter, on which it was highly dependent. More recently, at Oberlin College, Nelson 
has developed an integrated system of music programs called MPL, written in APL 
and running on a X>mr SJgm* o computer }NELS?7{. MPL'c NOTATE function 
produces fairly high-quality scores, apparently with one voice per staff, on a pen 
plotter or graphics terminal. 
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There is one other dissertation on computer music setting, that of Charles Render 
[REND81]. Render’s field was music education. Unlike Gom&erg, Render did indeed 
implement a system, named MUSCOR III, but it is amazingly hard to find out from 
his dissertation — most of which is really a user's manual — what it actually does. 
On p. 7, Render makes a statement that could be interpreted either as a claim that 
his system can draw nearly all of the musical symbols in [ROSS70] — i.e., that it 
handles Aspect 3 for all these symbols — gc as a claim that his system also includes 
virtually everything of any sort in [ROSS7Q] — that it handles Aspect 2 as well. The 
former claim is fairly impressive. The latter claim, if that is what he means, is very 
hard to credit, both because no one else has come close to doing so and because his 
dissertation gives almost no evidence of it, either in implementation details or sam- 
ples of output (the only CMN in the dissertation is in very simple examples, and it is 
not even clear whether these were set by MUSCOR III). Render uses a very clumsy- 
looking alphanumeric encoding language, apparently of his own design, for input. 
MUSCOR III includes an editor. Again, it is bard to be sure, but it seems to operate 
on the encoding language and in a batch mode only. 


Last among batch systems, my own SMUT will be discussed in detail later. 

I turn now to interactive systems. One of the best-known and best music-setting 
systems of any kind is Leland Smith’s MSS [SMIT73, SMIT78), developed at Stan- 
ford University and long in use st the Center for Computer Research in Music and 
Acoustics there. MSS appears to do quite well with simple to moderately complex 
music in a batch mode. However, its real glory is its ability, with human interven- 
tion from a graphics terminal, to do just about anything, CMN or otherwise. [19 J 
Input is with an alphanumeric language of Smith’s design and MSS can give feedback 
in CMN on a display screen. MSS can do scores with multiple voices on a staff, 
although it seems usually to require manual intervention for correct stem direction, 
slur position and curvature, and so on; however, it is sufficiently interactive that this 
can be done rather easily and quickly. It can print groupets, slurs, chords, dynamics, 
octave signs, and a wide variety of articulation marks and ornaments. It has a good 
automatic beaming capability. When its interactive graphics capabilities are used, 
MSS seems to be fairly machine-dependent (20j. The visual quality of MSS *5 notation 
is very good, although still noticeably inferior to engraving, for example on text, 
accent marks, and slurs. (As I have previously observed (Sec. 2.4), slurs are among 
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the most difficult symbols in music notation.) It also makes stems too short under 
some circumstances. It produces output via electrostatic or pen plotters. Smith’s 
system is the only one 1 know of that does page layout automatically in such a way 
as to guarantee a full last page (see Sec. 2.3.6.4). In fact, its layout facilities are 
exceptionally sophisticated in that they calculate how much space is needed for every 
measure in an entire movement before casting off, i.e., deciding system breaks. This 
allows MSS to make much better decisions than the usual one-system-at-a-time 
method can in placing system breaks as well as page breaks. (See Secs. 4.6 and 4.7.) 
Notation produced by MSS has been published fairly widely, for example in 
[MOOR77] and [CHAF82], as well as in numerous publications of Smith’s San 
Andreas Press. One of the latter is (SMIT79], which calls itself “probably the first 
book on music ever to be typeset by completely computerized means". (A 1970 book 
referred to on p. A33 of [ROSS70] was probably not completely computer set.) 

Armando Dal Molin is certainly one of the important pioneers in computerizing 
CMN setting. He has described his work in two papers (DALM75, DALM78). Dal 
Molin has been in the music-setting business for many yean, and has been working 
on automating the process since the 1940's, when he developed (and sold) a music 
typewriter of exceptional sophistication. In the early 19S0's, he modified it to operate 
electrically and added a paper-tape punch “to provide digitized spacing and line 
justification". I have already described (in Sec. 3.2.6) his pseudo-clavier keyboard 
and “PCS" input language, which he has used in two inpnt devices of his design. 
The first is the “PCS-300 Music Keypunch", a descendent of his music typewriter, 
with paper-tape output to send data to a minicomputer for hard copy. The PCS-300 
is now obsolete and has been replaced by the “Musicomp" (formerly called “PCS- 
500"), a microprocessor-based graphic CRT terminal. Both devices give the operator 
feedback in CMN aa she or he enters the music, and the Mnsicomp has good editing 
capabilities. Dal Molin produces final output of essentially engraving quality on an 
old photomechanical typesetter driven by the minicomputer. Beams, slurs, hairpin 
dynamics, etc., are all available on the Mnsicomp, but are currently added by hand 
to phototypesetter output, though Dal Molin expects to replace his phototypesetter 
with a new printer cf come kind that will be able to prist them (DALM82). Accord- 
ing to Dal Molin, as of 1973, over 1000 pages of music per month were being pro- 
duced with his system. 
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A recent arrival on the music-setting scene b William A. Watkins of MUSI- 
GRAPH. Watkins b in the commercial mnsic-setting business, using hb own system 
[WATK82], which runs on Radio Shack TRS-80 personal computers, interactively 
handles very complex textures, and produces graphically excellent results. It seems 
quite remarkable at Brat that it b practical to compute high-quality CMN on such a 
small and slow computer (the processor b a Z80). Not to gainsay Watkins’s achieve- 
ment, it b somewhat less impressive when one considers that he produces hard copy 
on a (photomechanical) typesetter, which is less demanding than standard graphics 
devices for two reasons (both of which were mentioned in Sec. 3.3.4): the computer 
does not draw symbols itself — the low-level work b done by the phototypesetter; 
and, in Watkins' words, “curved and slanted lines" (all slurs and, presumably, most 
beams, hairpin dynamics, etc.) are not done by machine at all, but are added manu- 
ally. Regarding input of the music, Watkins says “Our system is similar in many 
ways to a modern word processing system in that the operator, working from a 
manuscript, ‘types' the music which b displayed in music notation on a CRT," 

The next few systems should perhaps be considered to occupy a category of their 
own, which might be described as "clavier-based" formatters. Most have rather lim- 
ited editing capabilities. 

Perhaps the first workers on input from a directly-connected instrument were 
Prentiss Knowiton and his colleagues at the University of Utah [KN0W71, 
KNOW72]. They connected a complete electronic organ, not just a keyboard, to a 
PDP-8 equipped with ,a randcm-scan dbplay. In fact, what they built may well have 
been the first integrated computer music system that could accept input either in an 
alphanumeric language or from a clavier and that could produce output either in 
CMN or via sound syntheab (in this ease analog, not digital, from the organ). Its 
notation capabilities appear to have been rather limited both in extent and in qual- 
ity. One reason for the low quality was that "music symbols [were] constructed from 
a small number of lines to minimize flicker on a refreshing [i.e., random-scan] 
dbplay." The system could, however, display music in a piano-roll-like notation as 
well as in CMN. « 

In Denmark, a consulting firm named Dataiand, under the leadership of mogen* 
Kjaer, has developed a very impressive formatting system called "Scan-Note". The 
only oublbhed information on Scan-Note I know of b rather sketchy [CMJ79]: 
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The operator eaten pitches through a traditional organ-type keyboard while keying 
in snpplementary information (duration, legato/staccato, start of slor, etc.) on an 
auxiliary typewriter-style keyboard. An additional typewriter is used to input text, 
and to farther specify the final appearance of individual works. 

This input is processed Erst by a microprocessor, then by a FORTRAN program run- 
ning on a large computer. The music b finally drawn on a pen plotter “at approxi- 
mately twice the desired final sire. Since the Scan-Note does not accommodate every 
conceivable notation symbol used today, some symbols are added by hand during 
final editing." [CMJ79] includes a page of an orchestra] score produced with the 
Scan-Note system. It shows most of the basics of CMN — situs, articulation marks, 
dynamics, etc. It involves voice grouping (see Sec. 2.3.0.2.4), staves with two rhythm- 
ically independent voices and instances of two-note chords, all in moderately complex 
situations. No groupets occur, and the accompanying text does not mention them. 
The overall quality is nearly as high as typical engraving. I have also had access to 
some unpublished information [ANDE79], which adds the (unsurprising) datum that 
Scan-Note does automatic transposition. Dataland originally offered Scan-Note only 
in the form of a service, but haa recently attempted to sell packaged systems to 
music publishers. 

New England Digital Corporation has recently made available a music-setting 
option for their "Syncltvier", one of the best-known digital synthesisers (NEDC82, 
NEDC83bJ. It can utilise either the Synciavier’s keyboard in real time (requiring the 
nser to play along with a click track) or an alphanumeric language for input. CMN 
feedback and editing are done with a medium-resolution graphics terminal and hard 
copy b produced on a small dot-matrix printer. The editing capabilities appear in 
practice to be quite limited. The system can produce scores with one or two voices 
per staff, where each voice can include chords. When notation is done from keyboard 
input, groupetc are apparently never generated automatically, but they can he edited 
in. The same b true of dynamic markings, while slurs are not available at all. 
Vbual quality b adequate for readability, though far from engraving quality. 

A rather similar system was the “McLeyvier” [CM J82, GIL882], so called after its 
inventor, David MeLey. It was intended as a packaged commercial prodcct consbt- 
ing of a computer, a digitally-controlled analog synthesizer with clavier, a medium- 
resolution (512x512) graphics terminal, and s. small plotter or dot-matrix printer. 
The McLeyvier was unique in that from the beginning it was intended more as a tool 
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for composers than performers, and this significantly affected software developed for 
it. As I have mentioned before, the McLeyvier supported noa-real-time clavier input 
in addition to alphanumeric and real-time clavier input. Its notationa! capabilities 
were similar to the Synclavier’s, and were intended to work largely automatically: 
only a small amount of editing could be done on its notation. The McLeyvier was 
briefly available for purchase, but is now being redesigned under new auspices (Syn- 
tromes of Toronto) with an all-digital synthesiser and many improvements, including 
much more powerful notation editing [SPIE83a, SPIE83b], It will also have a new 
name, not yet decided on. 

The “Mockingbird” system [MAXW82, MAXW83, ROAD8IJ, recently developed 
at Xerox Palo Alto Research Center by John Maxwell and Severo Ornstein. is a very 
interesting variation on the real-time clavier-based formatter. An overview is given 
in the paper “Mockingbird: A Composer’s Amanuensis” [MAXW83}: 

Mockingbird is a compoicr** amanuertti*, a computer program designed to aid the 
composer with the capture, editing, and printing of musical ideas . . . Mockingbird is 
not a publisher’s aid, although it does print music, nor is it a performer’s aid, 
although it css play; it is strictly focused on the composer’s need for a powerful 
scribe. 

Mockingbird is an interactive music notation editor. It knows nothing about the 
rhythmic, harmonic, or melodic aspects of music except inasmuch as they are 
represented in common music notation. To narrow the problem, we have concentrat- 
ed on handling piano music. Mockingbird cannot presently handle orchestral scores 
or music for instruments that require their own notational devices. 

Mockingbird was written in IC80, and it is somewhat surprising that no one had pre- 
viously built such an obviously interesting system. We believe that there are two 
principal reasons: first, we had at our disposal, for the first time, an unusually 
powerful set of hardware and software facilities with excellent graphics capabilities, 
and second, we made a number of key decisions . . . which allowed us to bypass some 
extremely difficult problems. 

The paper discusses the problems this quotation refers to; one is the voice- 
assignment problem, discussed in Sec. 3.2.5 of the current dissertation. Mockingbird 
runs on the Dorado, a very high-performance personal computer developed at Xerox 
PARC. All of the software b written in the PARC experimental language Men. 
Mockingbird may optionally use a Yamaha CP-30 electric piano to record or phy 
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back music. 

When music is played into Mockingbird, a score is initially produced (and 
displayed on a high-resolution raster-scan terminal) in a simple “piano-roll” spatial 
notation (Sec. 2.3.3.4), Very powerful commands are provided that allow interactive 
editing of all aspects of this score as well as converting it into CMN. The editing 
facilities include a menu selection mode which can also be used to enter music, Orn- 
stein has written of Mockingbird [ORNS82]: 

(We] wish there were an easy way to get Mockingbird “out" [for use outside of 
Xerox). But there isn’t. It’s based on a giant pyramid of experimental hardware and 
software both of which are extremely powerful and permitted us to take a major step 
forward relatively easily (less than 1 man-year’s work). 

An early interactive system was Donald Cantor's CRT-based music editor 
(CANT71). It was based on the earlier work of W. B. Barker, whose program may 
have been the first music system to use display terminals. Cantor’s system ran on an 
early minicomputer, a PDP-1 at Harvard University, and employed no less than four 
CRTs simultaneously. It was limited to twovoice music with each voice on a 
separate staff. Music was entered by the “menu selection” method (Sec. 3.2,3). The 
system could play back music (by generating square waves of the appropriate fre- 
quency), and was therefore similar in many ways to Knowlton’s contemporary system 
described above. The major difference, from our perspective, was input method. 

William Komfeld of M.I.T. has developed a “semi-interactive” music-editing sys- 
tem for the M.I.T. LISP Machine. The LISP Machine has a fairly high-resolution 
(about 800x1000) bit-mapped display and “mouse” pointing device as standard 
equipment. This hardware, together with the LISP machine’s sophisticated 
SMALLTALK-Iike multi- window user interface, automatically supports graphics pro- 
grams with a significant amount of interactivity. Nothing has been published about 
Kornfeld’a system except for two typical screen displays reproduced on the covers of 
a journal [KORN80], The following information was given to me directly by Korn- 
feld (KORN82J. His program is intended primarily as an editor, not a formatter. It 
works in image space with a bit map. Input can be in any of three forms: 
alphanumeric, real-time clavier, or non-real-time clavier. To edit music, the user 
mnrlr* the region of interest on the screen, using the mouse. The principal editing 
operations are “insert”, “delete”, “transpose” (in the musical sense), and two 
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rhythm-changing operations: "augment" (increase the durations by a constant factor) 
and “diminish" (decrease the durations by a constant factor). 

Several very simple CMN systems are available commercially and may be useful 
for some purposes. These include Passport Designs’ "Soundchaser Notevrriter" 
(CMJ82], which runs on an Apple II personal computer equipped with a synthesirer 
and clavier, and a new standalone device made by Yamaha, the MP-1 "Miniprinter": 
a portable electronic clavier with synthesiser and built-in pen plotter that "prints 
melody lines on a 2-1/2 inch wide paper roll" [CAE83]. Both of these systems appear 
to be limited to single-note-at-a-time music. (CMJ82] also mentions the "Score- 
writer” system for the Con Brio digital synthesiser, which appears to be more sophis- 
ticated, but I have not been able to obtain any details about it. 

3.6. HISTORICAL OVERVIEW OF MY SYSTEM 

The reader needs to know a few things about how I started working on this pro- 
ject to fully understand my approach. Before I knew anything abont computers, I 
was a composer. Like every composer, I found myself spending many hours on the 
essentially mechanical and unmusical task of neatly reeopying — sometimes with 
transposition — scores and parts of my pieces from barely legible pencil manuscripts. 
So it is not surprising that when 1 first learned something about computers in 1987 I 
thought, “I could program this machine to be my copyist”, and within a few months 
set out to do so.(21] 

The computer situation at Indiana University was for many years one of unlim- 
ited access to a “maxicomputer" (CDC 6800), with very mediocre timesharing (300 
baud), and almost no interactive graphics. This was fairly typical of large universi- 
ties until relatively recently. In any case, the mediocrity of the interactive facilities 
foreed me to take basically a ‘’batch-processing”, i.e., as highly automated as possi- 
ble, approach to everything. 

One thing I quickly became aware of when I started programming was the enor- 
mous amount of time people at I.U. were wasting duplicating work already done. I 
saw two reasons for this: (1) programs written for one machine usually could not 
easily be run on another; and (2) programs performing related functions that could 
have been used together often had incompatible formats. These observations led me 
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integrated system of music programs — including analysis and sound-synthcsb com- 
ponents as well as notation — that could easily be used together. The resulting 
“Indiana University Computer Music System” is diagrammed in Fig. 6. Arrows 
represent the flow of information; names in smaller print represent interface pro- 
grams* SMUT IS) of course, the snuiie-actticg program whose development was the 
kernel of my research. 

In accordance with my philosophy, and since a translator for MUSTRAN 
(described in Sec. 3.2.4) already existed, I decided to ose MUSTRAN as an inpat 
language. By calling the program that recognizes MUSTRAN a “translator”, I mean 
that it translates MUSTRAN from a free-formal context-sensitive form relatively 
convenient for humans into a fixed-format context-independent form much easier for 
another program to handle. However, the format of the information produced by 
MUSTRAN is not quite suitable for input to a notation program; besides, MUS- 
TRAN provides no high-level control features at all (see Sec. 4.3, 5.3.1, and 
(BYRD80) for discussions of such facilities). To translate MUSTRAN output into 
SMUT input and facilitate high-level control, I wrote an interface program, called 
SMIRK. (The other pathway shown into SMUT, through JANUS, is designed Tor 
compatibility with composing programs, such as MUSC and STOCHOS, and digital 
sound-synthesis programs, such as MUSIC V.) 

Additional background and discussion of the implications of these basic design 
decisions, especially portability, is given in my paper (BYRD80); however, I would 
like to make some additional comments here, SMUT’s overall structure was 
influenced heavily by two factors: my desire to make it portable, and its batch orien- 
tation. These factors are interrelated: the desire for portability was one reason for 
the batch orientation. Another result of the desire for portability is SMUT’s reliance 
on sequential I/O only, considering that until quite recently no widely available 
higher-level language supported direct-access I/O in a reasonably machine- 
independent way. This restriction to sequential I/O forced me to find ways to do the 
formatting that relied at each step very largely on local information. (As Gomberg’s 
quotation in Sec. 5.4 makes clear, this Is hardly the way a person formatting music 
manually works. Here is a case in which, it seems to me, properly imitating manual 
techniques would pay off; see Sec. 5.4.) The need to keep information localized, in 
turn, led to SMUT 's four-pass structure. 
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Chapter 4 is devoted to a detailed description of SMUT. However, a summary of 
its capabilities comparable to those I have given for other systems should be helpful. 
SMUT produces high- but not engraving, quality scores with one or two voices per 
staff. Aside from integration into the IUCMS, its outstanding features center on two 
areas: 

(1) Portability, both in terms of CPU and graphic output device. In 1930, I con* 
verted a version of SMUT from the original 60*bit maxicomputer (CDC 
6000/GYBER) version to a 32-bit virtual memory superminicomputer (PRIME 
SO series); in 1983, Kimball Stjckney converted a later version from maxicom- 
puter to another, similar superminicomputer (DEC’s VAX under the VMS 
operating system). Each conversion took 20 or 30 hours of work, less than one 
percent of the 5000 or more hours involved in developing the original version. 
Also, SMUT has been used to drive numerous output devices of all 
types — random- and raster-scan, display and hard copy — with essentially no 
incremental work per new device, simply because of its reliance on extremely 
simple graphic primitives (see Sec. 4.7). See [BYRD80] for more discussion of 
portability issues. 

(2) Automation. SMUT, with some aid from SMIRK, provides several important 
high-level user-interface features: (a) transposition, i.fc., changing pitches by a 
constant; (b) automatic clef changes to avoid excessive ledger lines; (c) 
automatic breaking of multibar rests into series of whole-rests; (d) performance 
directions whose printing is conditional on their belonging to the top voice of a 
group; (e) automatic rhythm clarification (see Sec. 4.4.1); and (f) automatic 
beaming, beam positioning, and fractional beam pointing (see Sec. 4.4.2). 
Features (a) through (d) are especially useful because they mako it possible, in 
most cases, to encode a piece of music in such a way that the user can obtain 
from it either a score or a set of parts, with few or no changes to the data. 
Doing so requires one or more of these features in most cases. Features (b), (e), 
and (f) should be particularly useful to persons who are employing a computer 
as an aid in the composing process, as suggested by (among others) Iannis 
Xenakis [XENA71]; in fact Xenakis' composing program STOCHOS has been 
interfaced to SMUT, as shown in Fig. 6. 
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NOTES 


(ij One difficulty in patting together each a survey is that much — perhaps most — of the 
relevant work baa been done in non academic settings and is not readily available. This b espe- 
cially true in recent years, as computer music setting has gained considerable commercial impor- 
tance. As a result, a large amount of important work b unpublished. However, I have obtained 
quite a bit of unpublished information from the researchers themselves, most of whom were very 
cooperative. 

|2| [MOOH75J describes a precursor, the "Melograph" (not to be confused with the 18th-century 
melograph ! have previously mentioned, which undoubtedly was driven by * directly-connected 
instrument). Thb was a "special-purpose piece of mostly analog hardware and a chart recording 
scheme” developed for th* UC1.A F.t'-^omcsicolory D-rarcrs cat f oi iooso^soLir muro. 

could produce a graph of pitch aa a function of time. Neither the 18th-century nor the 20th- 
century melograph produced anything like CMN. 

[3j In the graphics literature, tablets are usually described as "locators”, and pobtbg devices are 
usually described as "picks”. In thb terminology for interactive bput devices, real picks are 
rarely used; instead they are. usually simulated, most often by locators. I will generally avoid 
thb terminology because l feel that, for our purposes, it would cause confusion more than it 
would help. 

[4] Again, an analogy to text processing presents itself: the "on-line character reeogn»er’\ See 
[NEWM79, p. 2028]. 

|5j The developers of DARMS argue that using a letter name for the notation of a pitch implies a 
low-Ieve! music-theoretic deebion that should not be part of the encoding process. The 
justification for thb claim b too technical to discuss here, although it has to do with the relation- 
ship between vertical position and pitch. The principle b certainly valid, but I find its applica- 
tion here totally unconvincing. Another argument in favor of the DARMS technique b given by 
Gomberg [GOMB75, p. M|, 

(fij I say "almost certainly not as well” on user-interface grounds. It b very difficult for a system 
in which the user's data b run through several programs in succession to give error messages as 
helpful as those that can easily be given by a single program. 

|7| The meaning here of the phrase "explicitly says” deserves some comment. In a typical system 
of the type I am describing, the keyboard controller periodically sends the computer a string of 
bits each of which represents a key, with "1” bits representing keys that are currently depressed. 
From the viewpoint of a program, thb b certainly a more explicit representation of pitch than it 
would get from campling the acoustic signal generated by a performance. The human viewport 
can be quite different, however. Imagine a situation where, let us say, the Duchy of Grand 
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Fenwick is at war with the United States and forbids the possession of recordings of the "Star 
Spangled Banner’ 1 , Would a court be more likely to convict someone who has an ordinary audio 
tape of an organ performance of the piece, or someone who has a magnetic tape containing digital 
samples of the same performance! Very likely the former. Lest thb sound too silly, a pornogra- 
phy trial was once held that turned on a similar question for videotape, 

[8j Note, incidentally, that thb entire discussion b based on the assumption that the performer b 
trying to play in a constant tempo {and, of course, that the computer knows what the tempo b). 
If thb assumption is violated, as it nearly always is in ordinary performances — i.e., ones that are 
not for the benefit of computers — the problem b far more difficult. 

[9] It may be objected that voice assignment of a note does not always have notational conse- 
quences, Thb b true, but it has such consequences so often that the problem cannot be ignored. 

(iGj The crossing of voices also causes serious purely notational (typographic) difficulties, as men- 
tioned in Sec. 2.3.6,2. 

(Il| The same classification scheme is appropriate for methods of loading the memories of rfspfnf 
fcfaencerf, devices that are used in conjunction with electronic music synthesizers to record series 
of events (ordinarily notes) so that they can be recalled and played back with, typically, a single 
keystroke. See the article by Devarahi [DEVA83|. The term "single-step", commonly applied to 
sequencers, is equivalent to ray "non-real time". Devarahi says "In single-step loading, each note 
b entered as an isolated event, either from a calculator- type keyboard or from a [clavierj." 

[12] A remarkable book — |CARL78| — exists which seems by implication to "give the lie” to my 
statements about the iimitations of music typewriters. It contains amazingly complex and subtle 
pictures made on ordinary Roman-character typewriters. However, the book makes it clear that 
the pictures were done with such techniques as rotating the paper in the carriage, varying pres- 
sure on the keys (of nonelectric machines) to change contrast, doing fractional spacing, typing 
through cut-outs, etc. The enormous amount of work thb will take for nontrivial results b obvi- 
ous, so that doing complicated graphics by typewriter b really an example of what has been 
called, with apparent reference to the Turing machine, the "Turing Tarpit": a technique that 
makes everything possible bat little or nothing practical. More direct evidence of the utter 
impracticality of doing CMN on conventional typewriters b provided by a statement by one of 
the artists whose work appears in (CARL73), Rochelle Vickey (VTCK83J, that "I have attempted 
several times to type out music. Of all the feats I’ve performed with the typewriter, that one 
completely defeated me!" 

[13] One music publisher, Norello, seems to dbagree. They have used movable type for a sub- 
stantial amount of music, including such a complex work as Bach's B- Min or Mats, published by 
them in 1907. I do not know if they are still using movable type. 

[14] Knuth gives another brief hbtory of typesetting technology [KNUT79, pp. 15- 16] in which 
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he argues that 1000 points per inch resolution b, for all practical purposes, perfect: 

1 via quite skeptical about digital typography, until I saw in ictuil simple of whit vis dons on i 
high quality machine and held it under a magnifying glass: It was impossible to tell that the 
letters were generated with a discrete raster! The reason for thb b not that our eyes can't distin- 
guish more than 1009 points per inch . * . [but] that particles of i \mk can't distinguish such fine 
details — you can': print the edge of an ink line that xigx&gs 1000 times on the diagonal of a 
square inch, the Ink will round off the edges. 

(15] Among commercially available products, as of early 1983, the highest resolution random-scan 
displays are about 4000 by 4000; the highest resolution raster-scan ones, about 1300 by 1000. For 
hardcopy devices, the figures are invariably given as points over a linear dbtance, not total resohr* 
able points. Random-scan hardcopy devices (and phototypesetters, as I have already said) fre- 
quently exceed 1000 points per inch, while raster-scan ones rarely exceed 300. Many of these 
numbers could be raised by a factor of about 2, depending on how "commercially available" b 
defined. 

(16) Gray leveb are not quite irrelevant, in that — properly used — they can make a display 
appear to have much higher resolution than it actually has. The technique, which b of great 
importance in many types of graphics, b called antialiasing. See (FOLE82), p. 436. Also, color 
played an important role to music notation at various points to hbtory, and it b sometimes used 
now to printed music for didactic purposes, although neither usage could possibly be considered 
part of CMN. For examples of music set to color by my system, see [HOFS82a]. 

(17) Thb b because most vector devices, hardcopy or display, behave rather idiosyncraticaliy at 
the endpoints of vectors. For example, vector hardcopy devices usually depend on ink which 
takes a measureable distance to start Sowing evenly. The inverse operation — converting from 
vector to raster form — b straightforward and, to fact, b generally referred to simply as "scan 
conversion" (see [FOLE82]). 

(18] (RASK80) includes a fairly high-quality sample of music, four measures in length, printed on 
a pen plotter by a program of Raskin’s to 1967. He gives no details of the program at all: how- 
ever, according to [GOMB75], Raskin 

produced a rather standard graphic* package which produce* arbitrary items on a plotter. By 
specifying coordinates, lengths, etc., he produced a drawing which happened to he four measures 
of music, but which might as well have been a portion of * circuit diagram. There was no attempt 
to determine the placement of the components to any automatic way. 

Such s system attacks only Aspect 3 and, therefore, b more comparable to Hiller’s music type- 
writer than to Gabura’s system or to most of the others discussed here. 

(19j to bb dissertation (GOMB75], Gomberg used two pages from the Double Concerto by Elliott 
Carter as examples of great notational complexity. Smith has printed one of thw his 

system, "lightly simplified, ss a demonstration. 
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[20] This direct relationship between use of interactive graphics and machine-dependence b to be 
expected. See [BYRDSOj. 

(2l| My first estimate of how long it would take to have a usable program was on the order of a 
few months, not H years. Underestimates thb extreme of the difficulty of programming CMN 
seem to be rather common. 



4 

How SMUT Works 


4.1. INTRODUCTION 

As I have pointed ont (in Sec- 3-1 }, one can view music setting as having three 
aspects: selecting the symbols to print, positioning them, and actually printing them. 
My system, SMUT, is concerned with all three. It has a number of automatic 
features that affect what symbols it prints. Nearly all of these can be disabled, but a 
few (related to system breaks: see Sec. 4.5) cannot be f since I feel that what they do 
is so deeply embedded in the syntax of CMN that it would not be worth the effort of 
making them selectable. 

Section 2.3 described the basic logic of CMN, intended, as I said there, to impart 
a “reading knowledge” of music notation. It did not give any of the rules for select- 
ing and positioning symbols that are part of “writing knowledge” — rules whose 
discovery and implementation has been a major part of my work. As I said in 
Chapter 1, “The closest thing I know of to an explicit statement of the rules of music 

notation is still no more than, to choose a number, 5% explicit (as compared to 

the usual value of, say, 2%).” In this chapter I will first try to give the reader some 
intuition about the problem by painting out some questions that arise in a fragment 
of simple single-voice music, in a complete page of score by Bach with one voice per 
staff, and in a fragment of music with two voices per staff. As I proceed I will com- 
pare these rules to those given by other writers, I will next give an overview of the 
organisation and operation of SMUT, and then describe some of its more interesting 
features in detail. I will conclude the chapter by “talking through” the beginning of 
the same page of Bach, giving much more detail. 

Note: in this chapter I will criticize my work as I go, using the convention of 
enclosing self-critical comments in bold curly brackets, viz., 
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4*2. SOME BASIC QUESTIONS OF POSITIONING AND SPACING 
4.2.1. One Voice 

Fig. 1 is a abort fragment of a Stephen Foster song containing a few quarter- 
and eighth-notea, with no accidentals or any other complicating symbols. It 
already brings up a number of questions, however. Should note a terns go up or 
down! How long should they be? These questiona seem at first to have very 
straightforward answers, and, in fact, they do — in sufficiently simple cases. Rosa 
says [ROSS70]: 

A note on or below the second space [from tbe bottom of the staff] has an up- 
stem on the right aide of its notchead. A note on or above the middle staff line 
has a down-stem on the left side of its notehead. 

Aa for length. Stone says [3TON80]: 

stems on single notes should be one octave long unless the note b farther than 
one octave from the middle line of the staff, in which case the stem b lengthened 
to reach the middle line. 

The standard texts, and SMUT, all agree on these simple rules for simple cases. [1] 
What constitutes a non-aimp!e case, then? Stone suggests one: non-single notes, 
i.e., more than one notehead on a single stem. Another case is when two voices 
share a staff; the rules for this case are given in Sec. 2.3.6.2.I. Neither of these 
situations occurs in our little fragment, but a third one does: notes in beamed 
groups with stems. Here, each stem must end at the beam, a constraint that will 
nearly always be inconsistent with the individual notes’ normal requirements for 
stems, in terms of direction, length, or both. The rules here are fairly well agreed 
on but far from simple. Ross devotes no less than 20 pages to this question, 
mostly in tables. Gomberg [GOMB75] expends less paper on the point than Ross, 
but probably sheds more light on it than Ross or anyone else has; in any case I 
will not enter the fray here. I will, however, say something about SMUT’s 
method in Sec. 4.4.2. 

A question of a different sort is, how much horitontal space should be inserted 
between symbols? The principle was expounded in Sec. 2.2.3.3, namely that the 
ideal distance h*tw*#T* consecutive time-occupying ojmba Is — notes or 
rests — increases with time, but more slowly than linearly. The nonobvious for- 
mula actually used by SMUT, involving exponentiation and logarithms, is given 



Figure 1. Positioning and spacing: one voice 
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in Sec, 4,0, This ideal spacing is actually realized in Fig. 1, because of its simpli- 
city, while it is violated continually in real music, mostly for those short notes 
that need more than the ideal amount of space. This might be because of 
accidentals or dots on the short notes, or it might simply be because the (human!) 
editor did not want to allocate more than the minimum possible space to, say, 
500 32nd-notes in a piece so that a very few (Mth-notes could be closer together 
than the 32nds. See the next section and Chapter 2, Fig. 32. 

4«2.2« Several Staves, One Voice Per Staff 

A p3ge of music by Bach, the first page of the so-called "Sloth Canon" from 
the Muiieal Offering, BWV 1079, is given as printed by SMUT in Fig, 2, Several 
points or interest are labelled with numbers (enclosed in circles); I will now com* 
ment on them, one by one. 

(1) The head$ of notes occurring at the same time are (when possible; see Sec. 
2.3.6.2.1) aligned, not the stems or the first components of the note "hunks" 
(Sec. 2,3.6.2.1) f which include accidentals before the head and flags and dots 
after it. 

(2) Flags go to the right, whether they are above or below the notehead. 

(3) When a slur[2] extends across a system break, it is replaced with two slurs, 
one at the end of the first system and one at the beginning of the second. 
Notice that the left end of the slur that crosses into the beginning of a sys- 
tem cannot have a fixed left ^coordinate because it cannot begin until after 
the key signature, whose width can vary greatly (down to rero). 

(4) See item (5). 

(5) This dotted-eight h-plu»-16th group gets its ideal space allocation, since noth- 
ing in this or the other voices disturbs it. The similar group at (4), on the 
other hand, gets considerably more space in order to accomodate the top 
voice. This clearly shows the violation of ideal spacing I discussed in the 
previous section (Sec. 4.2.1). 

(6) Alignment here is interesting. The D-flat, the fourth note of the measure in 
the middle voice, is positioned further to the right than it would ideally be 
in order to leave room for the lowest voice. The 32nd- rest in the lowest 
voice starts in the middle of the middle voice's dotted eighth, and it is 
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Figure 2. Positioning and spacing: several staves, one voice per staff in the Bach 
Sloth Carton, score (set by SMUT). 
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positioned accordingly. Notice, however, that no additional space was 
needed for the flat sign in the middle voice, which overlaps the second note- 
head in the lowest voice. This poses no problem for the reader of the music. 

(7) This is another, rather dramatic, demonstration of non-ideal spacing. The 
32nd-notes in the bottom voice are spaced very noauniformly because of the 
naturals attached to two of them. As a result of the demands of the bottom 
voice, augmented by the two naturals, the Iflth-notes in the middle voice are 
much further apart than they "should” be. 

(8) Compare the two tics, which involve notes of identical pitch and even dura- 
tion: one is above the notes while the other is below. This is a consequence 
of the differing stem directions of the eighth-notes in the two situations, 
which difference results in turn from the beamed groups that the eighths are 
involved in. The first beam requires both of its notes to have stems up (see 
Sec. 4.4.2), which is normal for the tied notes, since they are in the second 
space; but the second requires both of its notes to have stems down [3], and 
the usual rule for slur placement is that the slur goes above the notes if one 
or more of the notes it encompasses has a down stem. [4] Also, the left end of 
the second tie had to be moved right from its normal position in order to 
avoid intersecting the note's stem. 

(9) The stems of several of the notes in this beamed group have been extended 
considerably so that none will be too short. 

(10) These two beamed groups ore very similar-looking, but their overall pitch 
contours are reflected by the beams' slopes: the one on the first is horizontal 
white that on the second tilts slightly down. 

Two global decisions are: where system breaks should go; and what groups of 
notes should be beamed together. The former is in principle very intuitive. The 
main principle underlying the latter is to show the underlying metric structure of 
the music as clearly as possible. 


4.2.3. Two Voice* Per Staff 

Chapter 2, Fig. 45 (Heethcven, Syzzphzr.y Nc* 9, III, p. 13S, clarinets) is 
another fragment, bnt a much more complex one than Fig. 1. It illustrates some 
of the problems that arise in music that has two voices per staff, many of which 
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■were mentioned in Sec. 2.3.6. 2. Note sterna most all be pointed np for the upper 
voice and down for the lower; slurs, ornaments, fingerings, gronpet accessory 
numerals, etc., mast be placed towards the ontside of the staff. By far the most 
difficult, honks need to be moved or even internally modified when either their 
noteheads, accidentals, or dots collide: these occur in the second, third, and fourth 
measures of the figure. Doing this automatically is beyond SMUT; it knows 
nothing of inter-voice collision problems. See the next section and Sec. 5.2.1 -5. 

4.3. PRINCIPLES OF SMUT'S OPERATION 

Internally, SMUT consists of an initialisation phase plus four “passes”, which 
roughly move down the list of aspects given above, progressing from the abstract 
towards the concrete. One important way in which this is visible is in the coordinate 
systems used, which 1 will describe when appropriate. 

In the Initialisation phase, one “layout line” and several “initialisation lines” are 
read. The layout line describes the page layout and also gives various global 
parameters for the run, such as whether the run is "normal” or “edit” mode. In 
normal mode, all four passes are run. The most basic option Is whether a score 
or a set of parts is to be produced. Most of the problems involved are identi- 
cal.^! 1° mode, there are two options: either the first three passes can be 
run to produce intermediate files describing a score or set of parts; or only pass 
IV can be run (with intermediate files from a previous run) to actually print the 
score or parts. These options make it possible to run separate programs between 
passes III and IV for special purposes such as interactive graphic editing c? com* 
bining music with separately formatted text.[flj 

The page layout is specified by five parameters, as shown in Fig. 3: height of 
staff (HS), length of staff (XLN), y-coordinate of the bottom of the top staff 
(YH), minimum y-coordinate for the bottom of the bottom staff (YMlN), and 
vertical distance between staves (YDELT). 

Most of the initialisation lines control features similar to those in typical natural 
language text formatters: lilies, running footers, etc. Two, however, are specific 
to music formatting. One controls “grouping of voices”, a specification of a 
hierarchy of the type described in Sec. 2.3.6.2.S, including which voices (if any) 
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share sta*es.[7] Voice grouping determines whether barlines extend across 
staves — thej join “related” voices into groups and are broken between 
groups — and how conditional performance directions are handled (see Sec* 3.5 
and Fig. 14)* Voices sharing a staff have their stems pointed away from each 
other, slurs, ornaments, and groupet accessory numerals pnt on the cutside, and 
so on, {Problem: SMUT is totally ignorant of the danger of collisions between 
two voices sharing a staff. In particular, when their notes are a second apart or 
in unison, the heads will overlap or be superimposed (see Sec. 2.3.5.2.1), and 
when both have accidentals and are within a sixth, the accidentals may overlap. 
See Sec, 5.2.1 for a lengthy discussion of such coUisiont .} The other music- 
specific command specifies how many measures arc to go in each system of 
music; if present it clears flag SYST to disable automatic placement of system 
breaks (see below). 

Pas m I is primarily the symbol-seiecting pass. It reads the entire data file, which 
must be arranged as all of voice 1, then all of voice 2, etc. Depending on the 
options enabled by control commands, it performs various “high-level” func- 
tions, as follows: 

(1) Deciding what accidentals to use, both regular and cautionary, if the flag 
CHROM is on. (Cautionary aecirfenfo/j are accidentals that are technically 
redundant but are included as reminders to the performer; they are often 
printed in parentheses. For example, ir C-sharp appears in the key signature 
and one measure has a C-natural, a C in the next measure without an 
accidental may have a sharp in parentheses in front of it. See Sec. 2.5.) 
(Problem: it is not always possible to determine cautionary accidentals for 
voices sharing a staff by examining the voices independently. For example, if 
a sharp appears on an F in one voice on a staff, a following F-natural in the 
other voice should always have an explicit natural. See [STONSO], p. 156.} 
SMUT allows pitch input in two forms, selected by CHROM. If CHROM is 
on, pitch is described “chromatically”: a statement of the actual pitch is 
made, not of the notation, and SMUT chooses the appropriate enharmonic 
representation. The CHROM pitch descriptor is simply an integer giving the 
number of semitones above the bottom note on the piano, AO in ASA nota- 
tion (Sec, 2.3.2). So C4 — middle C — is 39. A note with pitch descriptor 
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40, then. Eight he written tut a C with a sharp in front of it, a D with a flat 
in front of it, or (depending on the key signature and previous accidentals in 
the measure and assuming that no cautionary accidental is needed) as just a 
C or D with no accidental. If CHROM b off (the usual choice), SMUT 
expects an explicit description of the notation. 

(2) Adding barlines, under control of the Sag BAR. If BAR b on, SMUT sums 
the durations of notes and rests as it goes, continuously compares the elapsed 
duration in the current measure to the proper measure duration (as specified 
by the current time signature), and generates barlines when needed. 

(3) Rhythmically “clarifying” notes and rests, if the flag DESYNC (short for 
"desyncopation") b on. That b, any rest that makes the rhythm hard to 
read by obscuring the beat b broken into a series of consecutive rests, and 
any note that makes the rhythm hard to read b broken into a series of con- 
secutive tied notes (which I wili call “subnotes”). The algorithm for thb will 
be discussed in detail in Sec. 4.4.1. [8] If a note that has been broken down 
had a giissando on it, SMUT generates appropriate intermediate pitches for 
the subnotes; otherwise all of the subnotes have the same pitch. Note- 
related symbols are moved to the appropriate subnote (usually the first). 

(4) Deciding what notes to beam together, under the control of the time signa- 
ture and the flag BEAM, and which way tc point fractional beams (with 
algorithms discussed in Sec. 4.4.2). 

(5) Generating whole rests (separated by barlines, of course) from multibar rests 
if SMUT is making a score. (Handling of multibar rests is the only situation 
in which Pass I cares whether it is doing a score or a set of parts.) 

(0) Deciding when to change clef, if specified by the flag CLEF, When the user 
sets CLEF, she or he must specify which clefs the program may choose from; 
this is necessary since, as pointed out in Sec. 2.3.2, CMN employs only a sub- 
set of the existing clefs for any given instrument. Of course, changing clefs 
affects the vertical position of all notes, as well as of many other symbols 
(slurs, groupet accessory numerals, etc.), following. A good algorithm for 
deciding when to change clefs would be fairly complex, requiring, at the least, 
some lookahead. A really good algorithm would also need considerable 
knowledge of the semantic: of music; ece Sec. 2.5. SMUT b very simplem- 
inded about thb, however; with some minor restrictions, it simply chang-s 
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clef whenever it encounters a note that would need more than four ledger 
lines in the current clef and changing to one of the allowed clefs would reduce 
the number of ledger lines. 

(7) Handling SMUT’s very simple control mechanisms, namely loops with fixed 
iteration counts and conditionals. 

Pass I also makes some positioning decisions, such as: whether beams go above 
or below notes, and approximately where; whether slurs go above or below 
notes; groupet accessory numeral vertical positions; and stem directions (up or 
down) and stem lengths. (For beamed notes, stem lengths assigned here are 
approximate, since they depend on the position of the beam, which cannot be 
decided exactly until Pass III: see Sec. 4.0.) All of these decisions are affected by 
the SHARE flag. This flag is automatically set at the beginning of each voice to 
indicate whether the voice b the upper of a shared staff, the lower of a shared 
staff, or has a staff all to itself, but the user can (for special purposes) explicitly 
set it to any of these values at any time. Pass I writes its output on scratch file 
1, still with all the data for each individual voice together. The coordinate sys- 
tem used here is, for the time/x axis, "attack time" (see Sec. 4.6); for the pitch/p 
axis, an integer number of “half-lines" relative to the bottom line of the staff (so, 
for example, a note on the top line of the staff has p-coordinate 8). 

If a score is requested. Pass II sorts the Pass I output file by measure onto scratch 
file 2. In order to align everything properly in the score, SMUT needs to exam- 
ine everything happening in measure 1 across all voices, then everything in meas- 
ure 2 across ail voices, etc. (Problem: Thb method will not work for the 
significant body of music in which barlines do not always coincide, i.e., measures 
do not always begin or end together. However, this cannot occur in CMN, and 
being able to slice the mnsic into independent sections considerably simplifies 
matters. See [GOMB75] for an approach to music setting that does not assume 
coinciding barlines.} Of course, thb b much more easily done if the input b 
sorted by measure, which b the raison d’etre of Pass II. If a set of parts b 
requested, Pass II does nothing except output some statistics. 

Puss HI is concerned mostly with positioning. It rssds scratch fils 2, one bar of 
score or of a part at a time, and assigns final p-coordinates to several symbols 
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that could not be fully handled by Pass I — for example, beams. More impor- 
tantly, in its “justification” subroutine JUST1F, Pass III uses the time informa- 
tion it receives to assign temporary (because right-justification has not yet been 
done) ^coordinates to all symbols, i.e. f it does global punctuation If SYST is on, 
it decides as it goes when the current system is full and a new one must be 
begun. The process is quite complex; it is described in detail in See. 4.5. Pass 
III writes records, each of which describes one musical symbol, onto scratch file 
3, and control information — and y-position ofiaets for the system, the number 
of records for the system on file 3, etc. — onto scratch file 4. The coordinate 
system used now has floating-point position values for both x and y, normalised 
for a staff of height 1 with its lower left-hand corner at (0,0). 

P us IV reads scratch files 3 and 4 and actually does all of the printing. As it goes, 
it multiplies the temporary j-coordinate of each symbol by an appropriate con- 
stant so that the system is stretched to just the right length (as given on the 
layout line), unless righ {-justification has been turned off either globally or for 
this system alone. It further scales y-coordinates by the actual height of the staff 
desired, and offsets ^coordinates by the position or the bottom of the staff, to 
produce coordinates is the graphic output device's coordinate system. Pass IV 
achieves device independence by using only a very few vector-drawing primitives. 
See Sec. 4.7 for more details. 


4.4# PASS I: SINGLE-VOICE PROCESSING 

I will concentrate here on Pass Pa two fancy features relating to rhythm treat- 
ment. N.B. These two sections involve many technicalities of a musical nature that 
are not important to an understanding of the remainder of this dissertation. They 
do, however, go to show how much nonobvious complexity exists in music notation. 
The reader may wish simply to look at the examples to see what I am driving at. 


4.4.1* Rhythm Understandings Clarification to Insure Readability 

The difference between technically correct rhythm notation and readable 
rhythm notation is not well known, al though most musicians know that there it a 
difference. One important aspect of rhythm notation has to do with not “obscur- 
ing sure that the underlying rhythmic structure of the 
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meter is always recognisable. This requires in some cases writing what could be a 
single rest or note instead as two or more rests or tied notes: see Fig. 4. Curi- 
ously, the notation books make only a few perfunctory comments on this sub- 
ject.^] The clearest statement of the principles I have seen is in an elementary 
theory text [WIN079, p. 114]: 

There are two guiding principles for ail rhythmic notation: 1. The notation 
should show as clearly as necessary the underlying metric organization. 2. The 
notation should be as concise as passible. 

While perfectly correct, this statement is far from being explicit enough for imple- 
mentation on a computer (and, to my knowledge, nothing more definite has ever 
been published). [W1N079] goes on to describe several ways to notate a single 
rhythm, ranging from abysmal to excellent; for the (human) reader, these do 
indeed make the idea much clearer. But — considering the present limitations of 
artificial intelligence — examples are not of much help to a computer. It is cer- 
tainly not immediately obvious from any set of examples how the rules should be 
formalised. Consider the patterns to the left of the arrows in Figs. 4a, b, d, e, /, 
and g: these need to be rewritten as shown on the right of the respective arrows. 
(I will explain the numbers below the notes later.) On the other hand, Fig. 4c, 
which looks quite similar to 4a and <•, is ail right as it is. A first guess at the rule 
might be “subdivide a note or rest at tbe first internal point with rbythmie 
strength greater than the attack point." (The term “rhythmic strength" was 
introduced in Sec. 2.3.3.2; we will shortly give it a more formal definition.) This 
is certainly too strong — it fails on Fig. 4c and, depending on whether or not it is 
applied recursively, on either i or /— ■ but is on the right track. 

What constitutes clear, concise rhythmic notation? For one thing, it is obvious 
that no note or rest should ever extend into the beginning or across the end of a 
groupet: having a quarter-note partly inside a triplet ana partly outside might be 
concise, but hardly clear. But to answer the question completely, we need to 
know what the normal rhythmic structure of a measure is in whatever meter we 
are concerned with. The discussion of meters in Sec. 2.3.3.2 rives us some of this 
structure, including the term “rhythmic strength"; let us continue the analysis 
with an example along the lines of Chapter 2, Table 1. In 4/4 time, 
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Figure 4. Rhythm clarification 
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We now have an implied hierarchy of rhythmic strength below the beat, bat we 
also need a hierarchy of beats within the measure. [10] This is not hard. Firstly, 
the downbeat (the Erst beat of the measure) is always the strongest beat by far. 
In meters with 3 beats per measure, the second and third are equal; with 4 per 
measure, the second and fourth are equal but weaker than the third. Continuing 
up through 8 beats per measure, we might arrive at the following table: 

beats per measure strength pattern (0= weakest beat) 

1 9 

2 90 

3 900 

4 90 1 0 

5 90100 or 90010 

0 900100 

7 9010100 or 9010010 or 9001010 

8 90102010 

Evidently, two-part division of each level is predominant, with three-part divi- 
sion used where unavoidable. The “9” used above for the downbeat is not at all 
sacrosanct, but could be replaced by any number larger than about 4. Less arbi- 
trarily, 1 have used 0 for the weakest beat, with consecutive positive integers for 
successively stronger levels above it.[ll] These ccnventions are important in the 
algorithm I am about to present, and can easily be extended to include the 
below-the-heat hierarchy we have already discussed. We can simply use consecu- 
tive negative integers starting with -1 for successively weaker levels below the 
beat. The complete pattern down to two levels below the beat in 4/4 time is, 
then: 
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Jtj3 jto rm jtji 

9 -2 -i -2 0 -2 -1 -2 1 -2 -1 -2 0 -2 -1 -2 

For completeness, we need one more thing: a way of assigning rhythmic 
strength to temporal points within groupets that do not coincide with any time 
that can be expressed without groupets.[I2) This would not be too hard to do 
along the lines weVe been following. In effect, gronpets set op a temporary 
“metric environment”. However, I will oversimplify here (as SMUT does) and 
assign to all such points a large negative value, say -9. This will lead to incorrect 
results only in a few very complex situations (but sec Sec. 4.4.2). 

Matters are slightly more difficult in compound meter. (The rule SMUT uses 
for classifying meters as simple or compound was given in Section 2.3.3.2.I.) 
Above the level of the beat, the situation is identical, but below the beat, com- 
pound meters have two peculiarities. One is suggested by the non-parallelism of 
Figs. 4c and 4e. Also, consider the rewriting necessary in Fig. 5a, whose simple 
meter analog (Fig. 56) requires no rewriting. A little thought shows that the 
difference amounts to a much stronger need in compound meter, as compared to 
simple meter, to divide at the level of the beat and at the first level below the 
beat. Note that nothing comparable shows up at any lower level; consider for 
example Fig. 6, So, compound meters will largely be taken care of simply by 
skipping -1 when assigning strengths, that is, the level below the beat will be -2. 
The complete pattern down to two levels below the beat in 8/8 meter is therefore 

n n h n n h 

# 4 4 4 J * 

9 -3 -2 : 3 -2 -3 0 -3 -2 -3 -2 -3 

(The numbers beneath the notes in the figures are, of course, rhythmic strength 
labels.) 

We are finally ready to discuss the rhythm-clarification algorithm. It is as fol- 
lows: 

(1) Divide at any point >1 stronger than the attack. 

(2) If there is more than one point exactly 1 stronger than the attack, divide at 
the first such point. (These two steps correctly divide Figs. 4a, 6, and e, but 
leave c, d, f and g alone.) 



Figure 6. Low levels in compound meter 
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(3) If the note or rest crosses any point stronger than f ,he attack and ends at a 
point weaker than the attack, divide at the occurrence of higher strength 
(only one is possible, because of roles 1 and 2). (This takes care of Fig. 4rf.) 

(4) If the note or rest has now been divided, its first piece is now satisfactory; 
apply the procedure recursively to its second piece. (This takes care of Fig. 
4 /; e and g are still unaffected.) 

N.B. It is not obvious that this algorithm always generates notatable dura- 
tions, i.e., durations that can be represented by a single note rather than several 
notes tied together. (It will be recalled that possible single note durations are of 

the form *£dxT* f where d fa a power of 2 and n<4 (or whatever maximum 

k*0 

number of augmentation dots one chooses). See Sec. 2.3.3.L) In fact, the algo- 
rithm does not always generate notatable durations, although problem cases are 
quite rare; one is given in Fig. 7. When it generates a non-notatable duration, 
SMUT recursively divides it at the longest integral number of beats of duration 
shorter than its full duration, unless this would be zero, in which case it divides 
after the longest undotted duration possible. Some thought shows that this 
added step always generates notatable durations. In Fig. 7, the basic algorithm 
generates a duration of five Ifiths, which is not notatable. The new step correctly 
divides that duration into two eighths (which get fused into one quarter) and a 
Iflth. 

With the above "post-processing”, so far as I know, the rhythm-clarification 
algorithm works correctly in all simple meter cases for notes. Two problem areas 
remain, namely compound meters and con-notes, i.e., rests. The first of these b 
exemplified by Fig. Ag. Adding a step to the algorithm between steps 3 and 4 will 
handle such cases: 

(3s) In compound meters, divide multiply dotted notes that are at or above the 
beat level after the first note. 

I have not implemented this rule, however. 

The second problem area involves treatment of rests. It b clear that the rules 
for rests are somewhat different from those for notes, both in simple and in com- 
pound meters; see Fig. 8 for examples. The differences do not appear hard to 
formalize, but again I have not done so.(l3] 
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Fig. 9 gives “before" and “after" pictures, showing the results when the above 
algorithm was applied to a variety of rhythm patterns from Read’s book on 
rhythm notation [READ78]. i think that most musicians would agree that the 
program behaved correctly everywhere except for measures 2 and 3 of Ex. 2-4A 
(Read’s example number); there the program actually made things worse. These 
two rhythms probably c ann ot be handled correctly on a note-by-note basis: 
instead, the patterns need to be looked at as a wbole.[l4] 

4.4.2. Rhythm Understanding: Beams and Fractional Beams 

Another aspect of rhythm understanding that b important in a music- 
formatting program is deciding what notes to beam, where to put the beams 
and — least often considered — which way to point fractional beams. SMUT's 
beaming is controlled at a high level by a variable, BEAM, that affects both the 
extent of beamable areas and the "aggressiveness" of the beaming. Extent of 0 
means each beamable area extends for one beat (as determined from the time sig- 
nature). Extent of 1 means beamable areas extend for one unit of rhythmic 
strength one level above the beat, of 2 means for one unit of rhythmic strength 
two levels above the beat (almost always the whole measure). For example, in 
5/4 time, extent of 0 means beamable areas have one quarter-note duration, of 1 
means they have alternate half-note and dotted-half durations[l5j, and of 2 means 
they cover the whole measure. Regardless of extent, aggressiveness may be set to 
"aggressive" (beams within an area are broken only when SMUT’s simple-minded 
tests suggest a collision (see Sec. 5.2.1) with an intervening rest, clef, etc., is other- 
wise very likely), "timid" (beams within an area are broken by any occurence of a 
symbol that might cause a collision), or "none" (no beams at all). 

I have already commented on the standard texts’ lack of explicitness about 
fractional beam direction (see Sec. 2.3.3.2.2). Fig. 10 gives some examples. The 
basic idea is tbe same as that of rhythm clarification: fractional beams should be 
pointed so as to clarify the underlying metric organization of the music. An obvi- 
OHS Slot guess at a rule is “if a fractional beam is on the first note of a beamed 
group it points to the right, otherwise it points to the Ieft. ,, [l6] Once again the 
first approximation is not very good: this rule works on Figs- 10a, c, /, and g t but 
not b , d, or e* Bat with the aid of the rhythmic strength rating system we already 
have, pointing fractional beams correctly is not difficult* 
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Figure 10. Fractional beam pointing 
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First, observe that if there b a rert on one side, the beam must point in the 
other direction, so that we need only consider the case where both adjacent items 
are notes. Also, the fractional beam cannot point outside of any beamed group it 
is part of, so if it is on the first or last note of a beamed group it must point to 
the right or to the left, respectively. (These rules take care of Figs. 10s and c.) 
For cases that are still unresolved, let » x be the rhythmic strength of the note that 
has the fractional beam and #b and *2 be the strengths of the preceding and fol- 
lowing notes, respectively. Similarly, lei i s > asd tj be the attack times of the 
three notes. Then the algorithm I have implemented is as follows: 

(1) Set 1 = mm(max(« 0< rj),max(f 1 ,i;)). 

(2) Find the first point of strength at least # before f t ; the first such point at or 
after and the second such point at or after t t . These three points divide 
time into four intervals, each of which we take as closed on the left and open 
on the right in order to make a partition. We call these intervals, from left 
to right, 0, 1,2, and 3. 

(за) If ^ and l x are in interval 1 and tj is in interval 2, point ibe beam to the left. 
If is in interval 1 and f* and h are in interval 2, point the beam to the 
right. (This correctly handles Fig. 105, rf, e, and /,) 

(зб) If ^=0, 1, and ^=2, point the beam to the left. (This handles the only 

remaining example. Fig, 10$.) 

(4) If none of the conditions in steps 3a and 36 are satisfied, increment $ by 1 
and go to step 2. (There appears to be a danger of an infinite loop here. To 
detect it, the algorithm actually chocks here for a > 2 and, if so, quits with an 
error message; but, to my knowledge, this has never occurred.) 

Step 36 is rather lacking in intuiuveness. It does, however, handle every case I 
know of that 3a fails on. {Problem: SMUT’s failure to consider groupets in 
assigning rhythmic strength b more likely to cause mb takes here than in rhythm 
clarification.} 

Once it has been decided which notes are to be beamed and which direction 
fractional beams will point, what remains is to decide whether to position the 
beams above or below the notes, and then to assign exact coordinates to the 
beginnings and ends of the beams. SMUT does thb totally automatically in all 
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cases. Pass I decides whether the beams go above or below the notes, based on 
the average note vertical position and the most extreme note vertical position. 
Basically, the algorithm attempts to have the stems on most of the notes and on 
the most extreme note point in the correct direction. [17] It also assigns tentative 
ir-coordinates to the ends of the beams; final ircoordinates are not assigned until 
Pass III (see See. 4.6). 

4.5. THE LAYOUT TASKS AND THEIR ORDERING 

This section is based on an identically-titled section of Gomberg's dissertation 
[C-OME75, pp. Z4 — 30). A« an usually thorough treatment of the analogous problems 
in text setting has been given by Knuth and Plass [KNUT81]; I will refer to that 
paper repeatedly. 

Table 1 contrasts the sequence of basic layout tasks as performed by an engraver 
or other person formatting music manually, with those performed by SMUT. Two of 
the terms used here are not well-known except to music engravers. Casting off deals 
with an entire movement and consists of deciding how many measures (not 


TABLE 1 


Manual SMUT 


1 

Casting-off 

i m 

Beams, stems (begin) 

2 

Vertically position 

2 

Global punctuation 


staff 

3 

Casting-off 

3 

W Punctuation 

4 IV 

Local punctuation 

4 

Beams, steins 

5 

Beams! stems (complete) 

5 

Ties, slurs 

6 

Ties, slurs 



7 

Vertically position 




staff 


Codes: W: start writing (or engraving); IE: beginning of Pass III; IV: beginning of Pass IV. 
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necessarily an integer: see note 23) will go in each system. This determining of “sys- 
tem breaks” b closely analogous to the determining of line breaks in text [KNUT81). 
Punctuation is the process of determining the exact horizontal position of every sym- 
bol in the score; it involves, among other things, justification of the kind commonly 
zizm 'rivi text.[18] Referring tc i^xt retting, [KNUTSlj points out that the term 
“justification” b now often used for the process of deciding where line breaks in a 
section of text should go T while formerly it meant the process of adjusting the spacing 
of a line {after the breaks have been decided) to produce a desired length; this ambi- 
guity causes much confuabn.[l9] 

One problem with the manual process is intuitively obvious: at least some 
aspects of punctuation should be done before casting off — how can one decide how 
many measures to put in a system until one knows where the last symbol in each 
goesf However, without spending a great deal of time making a “practice run”, the 
manual formatter can do punctuation only while writing the finished score, and so 
she/he is forced to do casting-off first. What happens in practice is that the person 
does rough punctuation in her or hb head while casting off, and is liable to make 
serious mistakes as a result. Thb b one of many instances in many domains where 
computers have an advantage over people because they can do (in [KNUTSlJ’s term) 
“late binding”. A computer, of course, can do punctuation perfectly “in its head”; 
SMUT indeed does some of the punctuation — enough to know exactly how much 
room each measure will need — before casting off. However, it is more convenient to 
do some of the work later, so SMUT divides punctuation into two parts. (Gomberg 
does the same thing.) The first step in the whole process is what Gomberg calls global 
punctuation: assigning horizontal positions to symbols as if the entire movement of 
the score were to be printed on a scroll, i.e., a page of arbitrary width. Casting-off 
for the actual page width b then performed. Then, in Gombergb words, 

With the page boundaries decided, a 8nal heat punctuation must be performed 
which is a fairly straightforward (nearly) Iinesr expansion or contraction of the 
exist[ing] punctuation to fit the desired page width. 

An even more dramatic difference between manual and automatic techniques has to 
do with the moment at which the vertical position of the staff on the page b decided. 
Thb b again something the manual formatter must do quite early simply so that 
she/he can start putting ink on paper. A program can (and SMUT does) postpone 
thb till the very end. 
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4.6. PASS 3H: GLOBAL PUNCTUATION AND CASTING OFF 

On each call, Pass IH f s subroutine JUSTIF globally punctuates one measure of the 
score, finalizes beam and slur vertical positions, and optionally determines system 
breaks, i.e., casts off. 

Comb erg draws a distinction between symbols that require horizontal space and 
symbols that 

are forced to 2nd their space from that which b left over by those that require 
it * . . Tempo indications, other performance directions, slurs and ties are examples of 
elements for which space is found after punctuation b completed. 

I will term Gomberg T s dichotomy that of independent and dependent ^coordinates. 
Symbols with dependent ^coordinates are “parasitic” on those with independent 
coordinates as, for example, the position of a slur is dependent on a pair of notes, one 
of which determines its left end and one its right end. 

This is a good start; however, we need further to subdivide the independent sym- 
bols before we can discuss a punctuation algorithm. The need to distinguish a third 
type cf symbol is illustrated by Fig. 11. In our terminology, clefs are certainly 
independent symbols: they require horizontal space of their own.[2Q] Nonetheless, the 
treble clef in the middle of the first measure on the upper staff shows graphic charac- 
teristics distinctly different from notes, rests, etc.: this clef does not have any 
required horizontal position with respect to the second, third, and fourth eighth-notes 
in the other staff. Instead, it is simply right>justified against the next independent 
item on its own staff. Let us say, then, that independent symbols may be either 
totally ordered or partially ordered. We define the set of totally ordered symbols to 
include any symbol whose horizontal position is forced with respect to any totally 
ordered symbol in any voice. (Either type or independent symbol has its horizontal 
position forced with respect to any symbol in its own voice.) 

The various types of symbols in CMN are categorized as follows: 

(1) Independent, totally ordered: notes, rests (other than whole rests), barlines, key 
signatures, meter signatures. 

(2) Independent, partially ordered: clefs, grace notes, pauses. 

(3) Dependent: performance directions (depend on one independent symbol), g!i> 
sandi (depend on two notes or a note and a grace note), groupet accessory 
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numerals (on two notes or rests), beams (on two notes or two grace notes), slurs 
and octave signs (on two notes, which may be several measnres apart. The 
whole rest is a special case: it is normal); centered between two barlines. 

See also Table 2. 

1b order tc do its punctuation job, JUSTIF requires a list that includes all attack 
timet, that is, all times within the measure where a nrte or rest starts in any voice. 
In SMUT these times are expressed in multiples of a whole note. So far, this is what 
[GOMB75] calls the “rhythmie spine”. However, JUSTIF also needs entries in its 
attack time list for all other independent, totally ordered symbols. The problem is to 
insert these into the list in such a way that (1) the attack times of notes and rests 
are preserved well enough to allow proper spacing (which is, or course, a function of 
duration), and yet (2) the symbols we have just inserted — those whose duration is 
zero — have entries distinct in some way from their neighbors. This could be done in 
various ways, but SMUT accomplishes it with the following scheme: Let t be, for 
notes and rests, the actual moment when the item begins in whole notes relative to 
the beginning of the measure, and for other items, the moment when the preceding 
note or rest ended. Let d be 1 for key signatures, 2 for time signatures, 3 for notes 
and rests, 4 for barlines, and 0 otherwise. Then the attack time a is given by 

a = eeiling{1000<)+-^- 

(The numbers 1000 and 10 in this formula are not critical; any similar values would 
work.) Thus, the attack time of notes and reals is distorted enough to allow inserting 
other symbols, but not enough to affect spacing noticeably. This scheme places some 
limitations on what sequences of symbols will give correct results, but the limitations 
are all outside the syntactic boundaries of CMN: for example, a sequence of symbols 
involving two clefs without a note or rest in between will not be handled properly, 
but such a sequence can never occur in GMN. Observe in particular that the sequence 
“clef, key signature, time signature, note" (as often occurs at the beginning of a 
piece) will come out in the correct order. Groupets cause no special difficultie3.[21] 
For the first measure of Fig. 11, the attack time list is .2, .3, 125.3, 250.3, 375.3, 
500.3, 750.3, 1000.4. For the second measure, it is .3, 167.3, 250.3, 334.3, 500.3, 
1000.4. 
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TABLE 2. Operation Codes and Symbols in Passes m and IV 


op code symbol, 

subsy nabob 

JUSTIF type 

Representation 

Pass 

I 

staff/clef 

IP 




clef 


s,o 


o 

key signature 

IP 




accidental 


s 


3 

time signature 

IP 

T 


4 

note 

IT 




head 


0 



accidental 


s 



ledger line 





augmentation dot 


o 



note-aligned symbol 





stem 





flag 




5 

grace note 

IP 




(same components) 




8 

rest 

IT 

S,0 



rest-related symbol 





augmentation dot 




7 

barline 

IT 




repeat dot 


0 


8 

beam 

D 


m 

9 

performance dir. 

D 

T 


10 

groupet acc.num* 

D 

T 


11 

octave sign 

D 




number 


T 



dotted line 




12 

user routine call 

- 



13 

identifleation 

- 

T 

IV 

14 

slur 

D 

S 

m 

15 

miscellaneous 

IP 



16 

gibs an do 

D 

A 



Codes for Representation: O: outline, S: skeleton, T: text (see See. 4.7 for terminology). 
Codes for JUSTIF type: IT: independent, totally ordered; IP: independent, partially 
ordered; D: dependent. 

Codes for Pass: IQ: part of note command on Pass DI input Ole, created by Pass d; 
rV: written on auxiliary Pass IV input Ole. 
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JUSTIF assumes that data describing an entire measure is in an array, MBMB 
(for “MEMory for Bar"), as a sequentially allocated list of variable-length records. It 
consists of an initialization section and four main sections. The initialization section 
does only one task cf any interest: if the current measure is the first of any system 
other than the first, then for each staff it generates a clef and key signature identical 
to those in effect on that staff at the end of the previous system. The four main sec- 
tions make four passes through MEMB and do the following: 

Section I scans MEMB forward and punctuates type-1 symbols. The algorithm starts 
with the attack time list. It then: 

(1) Assigns ‘‘ideal” apacings between consecutive entries in the list according to the 
formula 

idcalspace^ 2.5c (ff~ (<>) 0 

where t x are consecutive entries in the attack time list, and e and 6= 2 a are 
set with the HCROWD and HBASE control commands, respectively. The 
defaults are 1.0 for c and 1.520 for ft. Here at last is the formula I have alluded 
to several times. As I said in Sec. 2.3.3.3, what we want is for spacing between 
consecutive symbols to increase with the time separating them, but more slowly 
than linearly. (The time separating consecutive symbols is, of course, just the 
duration of the first.) This formula does the trick as long as l<ft<2, i.e. f as long 
as 0<£<1, and a value of ft about halfway between comes the closest to giving 
the spacing of traditionally set music. (Note in particular that if ft- \f2 t £= 1/2, 
and the formula reduces to 


idtabpacz- 2.5 cy/t^% 

1 used this formula in SMUT for several years, but it does not differentiate 
enough between durations; a slightly larger value of ft gives better results.) e, a 
“crowding” factor, interacts with the rest of the punctuation process to produce 
an interesting effect. Of course, in this formula, it is simply a scaling factor for 
the ideal spacing. The interaction results from violations of ideal spacing. In 
nontrivial music, as we have seen, ideal spacing is constantly violated, but usu- 
ally only to a slight extent. But if c is made very small, ideal spacing will be so 
inadequate that it will cease to have any effect on the final setting, and every- 
thing wi!! end up simply squeezed as close together as possible, regardless of 
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*422) 

(2) If this docs not leave sufficient space between consecutive symbols within any 
voice, increases it just enough so that there is sufficient space. (JUSTIF knows 
how much space is needed both backward and forward from the reference 
point — the ^coordinate — of each type of symbol.) {Problem: JUSTIF doesn’t 
really know: see Sec. 5.2.4.} 

(3) If the resulting measure width overflows the current system and if automatic 
system breaking was requested, starts a new system. (This is essentially the 
same as what Knutb and Plass describe for text [KNUT81, p. 1120] as “the 
standard algorithm for line breaking 1 ’.)[23] Even if automatic system breaking 
was not requested, if the measure width overflows the current system badly, 
JUSTIF gives an error message and starts a new system. If starting a new sys- 
tem results in the old one having rero measures in it, JUSTIF gives an error 
message. If it does decide to start a new system, JUSTIF pays special attention 
to slurs, octave signs, and gliaaandi that cross into the new system. Specifically, 
it breaks them into two pieces, one for “here” (the end of this system) and one 
for “there 11 (the beginning of the new cystem}424) It then quits, returning a 
“start new system” flag. Note that, in this case, MEMB contains partly pro- 
cessed data for the current measure. 

Section I also checks the durations of the current measure in all voices for agreement. 
If the “start new system" flag is set on entry to JUSTIF, it skips Section I and starts 
with Section II. 

Section II scans MEMB backward, punctuates type- 2 symbols, and does preprocessing of 
beams for Section III. Recall that type-2 symbols are right-justified against following 
symbols; the backward scan is to facilitate this. The beam preprocessing normally 
consists simply of finding what the slope of each beam is, based on the stems of the 
end notes. A complicating factor here Is that GMN places an ill-defined limitation on 
beam slopes (according to (GOMB75), 30 degrees). If the calculated slope exceeds the 
maximum legal slope, the maximum legal slope is used instead. SMUT adopts the 
usual method of accomplishing the change of slope, which is to adjust the lengths of 
stems of notes attached to beams; these could not have been calculated before this 
moment, anyway, since — unless the beam happens to be horizontal — stem end 
coordinates that* will coincide exactly with a beam depend on punctuation. 
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Section III again scans MEMB backward (with short forward forays) and corrects end* 
of-stem coordinates for notes (including grace notes) in beamed groups. It uses the 
slope for each beam decided on in Section II to adjust the y-coordinates of ends of 
stems to end at the beam, and checks whether each note will then have a long 
enough stem. If not, it changes the stem end coordinates and moves the beam up or 
down while leaving its slope unchanged. 

Finally, Section IV scans MEMB forward, punctuates type-3 symbols, checks agreement 
of clefs, key signatures, and time signatures on shared staves, and outputs everything 
for use by Pass IV. (Problems with the current implementation: JUSTIF needs an 
image of the page to avoid collisions (see See. 5.2.1). JUSTIF should automatically 
generate "end-of-staff” material (new clef, key signature, and meter, if they are 
changing) as well as "beginning-of-staff” material. Pass III should do casting-oil for 
the whole movement at a time as Smith’s MSS does, not one system at a time. 
There is a close analogy in text formatting here: most text formatters decide line 
breaks one line at a time. As Knnth points out [KNUT79, KNUT81], to achieve 
optima! results, one must consider a whole paragraph at a time, as Ins TgX does.} 

4.7. PASS IVt OUTPUT 

Pass IV does all of the actual drawing of the music, one system at a time. It posi- 
tions each system on the page vertically in accordance with the instructions it 
receives, starting new pages when necessary; it does "local punctuation" by scaling 
^-coordinates to right justify the staff (this is disabled in certain circumstances); and 
it scales y-coordinates for the actual height of the staff. St also handles such details 
as generating running footers. 

The following discussion assumes some knowledge of graphics hardware and sys- 
tems. For background, see [FOLE82] or [NEWM79]. 

As I have said, portability was a very important design goal for SMUT. In Pass 
IV, this means independence of the graphic output device as well as of the CPU. 
Output-device independence in turn is highly dependent on the availability of stand- 
ardised subroutine packages for various devices. In the words of Foley and van Dam 
({FOLE82], p. 23): 

The main purpose of a device-independent package used in conjunction with a high- 
level programming language is to induce application program (and programmer) por- 
tability. This portability is provided in mneh the name way that a “high-level” 
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machine-independent language (such as FORTRAN) provides a large measure of por* 

lability: by isolating the programmer from most m^ehiss peculiarities and providing 

language features readily implemented on a wide spectrum of processors. 

Foley and van Dam go on to point out. the importance of standardization of such 
packages, and to describe the current state of affairs. Briefly, very substantial effort 
has now been invested in developing standards for device-independent graphics pack- 
ages (see [GSPC79] and [IS081]). However, as of this writing (early 1983), no stan- 
dard has yet been approved, and what is generally available is still fairly primitive. 
In any case, when the current work was begun (in 1968) such issues were hardly 
being considered. As a result, I have written Pass IV in such a way that it can be 
run on any device with an absolute minimum of support, but can easily be made to 
take advantage of greater support when available. 

Pass IV works in object space (see Sec. 3.3,6) with floating-point coordinates. The 
Pass IV symbol-drawing routines do all of their drawing by calling four subroutines: 
RFILL, THLINE, THPLOT, and STRING, The versions I have implemented of 
these routines have a basic model of the output device that is purely vector-drawing: 
the only primitives they use are PLOT (move cursor from previous to new position, 
optionally drawing a vector) and SYMBOL (write a string of text in a specified size 
and undefined font). As might be expected, nearly all real-world devices support 
these two very simple primitives, although writing text in any size is often not sup- 
ported in hardware. Many hardware character generators have size restrictions, so 
software to write characters in arbitrary size is very widely available. 

Now, Pass IV knows the resolution and linewidth of the output device, since they 
are specified by the user (on the layout line). For many devices, these two parame- 
ters have the same value; for pen-and-ink plotters, the linewidth is usually greater 
than the distance between addressable positions. In any case, with this information, 
RFILL and THPLOT fill in regions (noteheads, augmentation dots, parts of clefs and 
rests) by simulating raster-scan hardware, generating their own calls to PLOT to 
draw lines at intervals of the linewidth. RFILL takes as input a list of vertices 
bounding the region in question, i.e., an “outline* 1 representation. It can fill in all 
regions whose boundaries are convex polygons (actually, it can handle some regions 
with concavities, but SMUT does not rely on this capability). THPLOT draws 
“thiek vectors* * — for example, the non vertical strokes in the sharp and the natural. 
These thickened lines are in fact convex polygons and could therefore be drawn by 
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RFILL; I use THPLOT merely for its conceptual similarity to PLOT- THLINE 
thickens lines in symbols — slurs, bodies of clefs, the rounded part of the flat, etc. — 
by drawing the center line and edges (and, sometimes, middle lines; see below) from 
a “skeleton" representation, i.e., specification of a center line and (varying) linewidth. 
Finally, STRING thickens test simply by writing it repeatedly with small horizontal 
offsets, namely, successive multiples of the linewidth. Note that the output device’s 
linewidth, not its resolution, is used in ail of these cases. Resolution is used directly 
in only one way, to decide whether to thicken slurs at all or to draw a single “zero- 
thickness” line. SMUT draws every slur as a circular arc; when it thickens one, it 
simply uses the same endpoints with slightly different radii of curvature to generate 
additional arcs, then calls THLINE. {Problem: circular arcs are really not the 
correct shape. Slurs in engraved music, especially long ones, are more curved at the 
endpoints than in the middle. Also, slurs can have inflection points, as noted in Sec. 
2.3.3.U.) 

The way in which many symbols are drawn is affected by a "publication quality" 
switch. If it is set, RFILL takes extra care in filling regions (by drawing the outline 
as well as making raster scans across the interior) and THLINE takes extra care in 
thickening everything, especially quarter rests, slurs, and clefs (by drawing middle 
lines, among other things). This is in an effort to produce the best possible plot qual- 
ity; however, it is likely to help only on high-resolution vector-drawing devices, espe- 
cially pen-and-ink plotters, and of course it slows execution down. 

This minimal or "least common denominator" output device method is extremely 
device independent and can produce very good results. For examples of SMUT out- 
put produced in this way, compare Figs. 2 and 12: these are, respectively, the score or 
Bach’s so-called "Sloth Canon" printed on a Zeta 3053SX pen plotter, and an instru- 
mental part for the same piece, automatically transposed with a clef change and 
printed on a Veraatec 1200 electrostatic plotter. The Veraatec has much lower reso- 
lution than the Zeta, particularly visible as jagged edges on slurs and beams. (SMUT 
took the low resolution into account and did not the thicken the slurs at all, since 
they would not have looked good.) See also Figs. 13 (printed on the Versatec), 14 
(Zeta), 15 (Zeta), 16 (Veraatec; a demonstration of some of SMUT's special symbols), 
and Chapter 3, Fig. 4 (Zeta). However, this device-independent technique has a 
major disadvantage, uoiociy that it can be very inefficient ou more powerful 




FOURTH STRING QUARTET 
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Figure 13. Bartok: String Quartet No. 4, 1, p. 1 (sel by SMUT). 
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Mgure 14. Van Hulse-Kriewall: Partita Sapru Alleluia Octavi Modi (sel by SMUT). 






Figure J6 SMUT symbol demonstration 
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hardware. Such inefficiency can easily be disastrous in interactive graphics, since it b 
likely to increase response time significantly; bnt it b rarely serious in a batch pro- 
gram such as SMUT. The particulars described above do involve one compromise 
with visual quality: thickening of text should really be done not merely with horiron- 
tal offsets but also with vertical ones, preferably in a circular pattern. The image of 
a point under thickening should be a two-dimensional region, not a line. However, 
the inefficiency of drawing characters 0(n 5 ) times b too great, even for batch use; 
0(n) repetitions provides generally good quality with much better speed. 

As I have suggested, better support than the minimal PLOT and 
SYMBOL — e.g., hardware area nil or boldface characters, or fast assembly-language 
routines for area filling (perhaps by generating parallel vectors) — b becoming more 
and more widely available. When it b available, any or all of RFILL, THPLOT, and 
STRING can easily be replaced with trivial versions that simply invoke the appropri- 
ate operations, so nothing has been lost by the minimal approach, while nearly com- 
plete device independence has been gained. THLfNE is a little less easy: to take 
advantage of typical hardware features, it will have to convert from the skeleton 
representation it receives to a Ibt of vertices in the manner of RFILL. This b still 
not at all difficult. 

4.8. A SAMPLE RUN 

In order to make the principles of SMUT’s operation clearer, I will now go 
through a sample run that produces two measures of a three-staff score, specifically 
the first two measures of the score in Fig. 2. The data for tbb run appears in MUS- 
TRAN form in Fig. 17. The result of running it through MUSTRAN and SMIRK 
(see Sec. 3.5) is the SMUT data in Fig. 18, Appendix I gives a complete description 
of SMUT from the user’s viewpoint. However, the format of SMUT commands b 
quite simple, and I will a urn man re it here. 

A SMUT input file always starts by giving certain global information: it begins 
with one layout line (having “L ft in column 1) and several ini tiali ration lines (having 
various operation codes in column 1; see Table 3). (Also see the description of 
SMUT’s Initialiration Phase in See. 4.3.) The initiallratiou lines, in this case, run 
through line 9 and describe only page header and footer information; they are ter- 
minated by the line consbting only of “X”. 



LI. OS. E3tX. 4“4C. '*CEACH. SLOTH CAHtM FBOI 8171079 
■ : 12CADAQI0) ' , ••IHVEBSOS* . ZB. 8B. 180. IBP. 4E. / 
3EJ, 80, 20. , /,T(. 

IBB. 1BD. 1BE. 18F. 40. 60 J. 8HA. 4HB. /. 

8KBJ. 60*. 40*. 18D*J, ISO*. IBP*. 1BJE*. 

40*. /. 8D*J. 8HB. 2C*. 4B. /. 

4R. 4B. SB. SHE, 4F. /. 
aPJ. 80. 4P. 6FJ. 8*8. 4*A, /. 

8JU. 80. U, 8AJ, 80*. 48. /. 

L2. CS. n*K. 4=40. ’:12CADAaiO)’, **IROTAL THEBE’ . 
40. 1BCJ: BB-: C: D. 4E.. 8P. /. 

40. 160J. 1BE. IBP. lflO. 4A. , IBB. 1BC. /.T(. 

4HB-. 1BB-J. 1B0-. 168- . 160 . 40.. 16R. IBM. /. 
4«P.Z2. IBB. lflKE, 4BP., IBB. 160. /. 

4HE. . IBB. 160, 2*E. /. 

6EJ, 80. 60.. 16*0. 80.22. 16C. 60.22. 16HB-. /. 
4HB-.Z2. IBM-. 160-. 40.. 16B. IBP. /. 

4E. 18EJ. 180. ISO. 180. 4D.Z2, 80. /. 

L3. FEZ, E3*I. 4 “40. ’:12(ADAOIO)\ ••IHECTDS’, 

BB. 16R, 320. 320. BE.. 1CP. 40.. 32R. 32F, S2E. 320 

80.. 183- . BA-.. 180- . 8F-. 32T-J. 320-. 

32D-. 32HE-. 8P-.. 18A-, /.T(, 

40-. 4B. 8H. IBB. 18E. 80.. 1BC. /. 

60. . IBM-, 8HB-. . 1BC. 8HB-.. 160-. 8M-.. 

16B-. t. 

80.. .180.. 80.. 18*8-, 8*A-. , 16E-. 8A-.. 1B0-. /. 
4?-.. SZB, 320- . 32P-. 3ZHE-. 8F-.. 

3 20-. 32A-. 6KD-. , 32IE-. 32P-. /. 

40-.. 840- J, 84HA-. 84HB-. 84C. 640. 84E. 84P. 

840. 8*A, 32AJ. 320. 32P. 32HE. 8F.. 16HB-, /. 

80. 320 J, 320-. 320- . 32E-. 4P-. 1BF-J. 

180- , 18E-, 10F-, 160- , 18A-. ISO-. 13P-. /. 

EHD 


Figure 17. MUSTRAN data for the Sloth Canon 



1. IS 4 

2. Ha C* SLOTH CAHOH* 

а. hs 

4. H4 C'FHQM THE MUSICAL 0FFE3UH0. BI7 1079* 

б. HO R'J.S. BACH* 

a. H4 R*aees-i76o)* 

7. F5 C'SIBT 2.8 SAMPLE PLOT FROM IUSTRA1 BATA* 

8. F4 C'DOHALD BIRD. ICC. IHDIAHA UXI?.. BIOQMXRQT0H, IH 47405* 
8. R4 L* SLOTH CABOT* 

to. X 

11. ♦ OJUST 

X2. * 86HCRUH) 

13. • H0HAB 

14. • RODESTHC 

15. ♦ HOCHROM 

10. S 1 1 -3 4 4 “2 0 

17. •128TLFT128 

18. PI 4 02 a(ADAOIO) 

18. 10 -00 -0 SIR VERSUS 


20. 

R 

0 

2 

0 


21. 

R 

0 

e 

0 


22. 

1 

33 

0 

16 

0 

23. 

H 

32 

0 

10 

0 

24. 

n 

31 

0 

4 

OTI 

25. 

B 

1 




20. 

If 

31 

0 

6 

0 

27. 

H 

20 

0 

6 

0 

28. 

H 

28 

0 

2 

1 

29. 

B 

1 




30. 

X 





31. 

• 

85HCR0ID 



32. * HO BAR 

33. * HODESTTfC 

34. * BQCHRQi 

35. 8 1 1 -3 4 4 “2 0 

30. *128TLFT128 

37. PI 4 02 0CABAOIO) 

38. 10 -00 -011R0TAL THEME 


38. 

If 

28 

0 

4 

OTI 

40. 

H 

29 

0 

18 

0 

41. 

H 

23 

30 

18 

0 

42, 

H 

29 

0 

18 

0 

43. 

M 

30 

0 

18 

0 

44. 

If 

31 

0 

4 

1 

46. 

H 

32 

0 

8 

0 

40. 

B 

1 




47. 

n 

S3 

0 

4 

OTI 

48. 

n 

33 

0 

18 

0 

49. 

H 

31 

0 

18 

0 

60. 

X 

32 

0 

18 

0 

51. 

3 

33 

0 

18 

0 

52. 

X 

34 

0 

4 

1 

S3. 

R 

0 

18 

0 


54. 

X 

28 

0 

18 

0 

55. 

B 

1 




SO. 

X 






Figure 18. SMUT input data for the Sloth Canon 



67. • 8SHCR0ID 

S3. « 90B1R 

69, • 80BESTHC 

60. • HUCHRCH 

61* S i 7 -3 4 4 *2 0 

63. *1287LF712B 

63. Pi 4 02 8 CAD AO 10) 

64. ID -00 “0 0REC7US 


65, 

R 

0 

6 

0 


66. 

R 

0 

16 

0 


67. 

V 

29 

0 

32 

0 

68. 

9 

30 

0 

32 

0 

69. 

9 

31 

0 

8 

1 

70. 

9 

32 

0 

16 

0 

71. 

9 

33 

0 

4 

1 

72. 

R 

0 

32 

0 


73. 

9 

32 

0 

32 

0 

74. 

9 

31 

0 

32 

0 

75. 

n 

30 

0 

32 

0 

78. 

a 

1 




77. 

9 

29 

0 

8 

1 

78. 

9 

28 

0 

18 

0 

79. 

9 

27 

0 

6 

1 

60. 

9 

29 

0 

18 

0 

01. 

9 

25 

0 

6 

on 

82. 

9 

25 

0 

32 

0 

63. 

9 

22 

0 

32 

0 

84. 

9 

23 

0 

32 

0 

65. 

9 

24 

30 

32 

0 

88. 

9 

25 

0 

6 

1 

87. 

9 

27 

0 

16 

0 

86, 

0 

1 
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TABLE 3. Initialization Operation Codes 


op code meaning 

B Bars-per-system 
C Comment 

F Footer 

G Group voices 

H Header 

R Running footer 


The remainder of the inpat describes each voice in turn. It uses a set of opera- 
tions completely disjoint from those used in the initialization phase. Column I of 
each fine again contains an operation code, but now from the set given in Table 4. 
The remainder of each line contains parameters for the operation, in a fixed-field for- 
mat. In this case, the lines in the next group, those beginning with are SMUT 
control commands. They turn off right justification (11); ask that horizontal spacing 
be reduced to 35 percent of its usual value (12); disable automatic insertion of bar- 
lines and automatic rhythm clarification (13, 14); and specify which of the two possi- 
ble notations for pitch will be used below (15). 

Now the description of the music proper begins. Line 16 asks for a new staff with 
treble clef, key signature of three fiats, and logical time signature of 4/4, but written 
time signature of "C’\ (The distinction between "logical” and "real" time signatures 
is meaningful because, as we saw in Secs. 4.4.1 and 4.4.2, the time signature — in 
fact, the logical time signature — affects SMUT's automatic rhythm notation pro- 
cedures.) A command like that on line 17 is always generated by SMIRK in case 
there is an anacrusis (Sec. 2.3.3.2.3). This one says that the time remaining in the 
current measure (whose duration is four quarter notes, or 123 128ths) is 128 128th 
notes: that is, it says that there is no anacrusis. Lines 18 and 18 describe 
annotation* — respectively, a tempo marking and “staff identification" Line 20 at 
last describes the first time-occupying symbol (a half rest), line 21 the following 
eighth-rest, and line 22 the first note, the sixteenth-note G4. A tie occurs on line 24 
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TABLE 4. Pass I Operatic* Codes 

op code meaning 

A begin Artificial group (groupet) 
B Barline 

E Edit 

G Grace note 

2 Identification/indent 

M Miscellaneous symbol 

N Note 

P Performance direction 

R Rest 

S Staff-clef-fcey signature-meter 

T ocTave sign 

U call User routine 

* control 


("TI”), and the data for voice 1 are terminated on line 30 by the "X”, The only 
interesting new feature in the other voices is the accidental on line 41 (a natural, 
indicated by the ll 3Q H ). 

This is, then, a straightforward description of the music that says nothing explicit 
about symbol positions, not even stem directions. It does not say whether the ties 
should be above or below the notes, let alone give exact positions for them. Beams 
are specified to an even lesser degree — in fact, there is absolutely no information 
about them! The data (other than global information at the beginning) are organized 
linearly by voices: all of voice 1, then all of voice 2, then all of voice 3. 

The Pass II! input file illustrated, slightly truncated, in Fig. 19 shows the effects 
of two major transformations, namely those of Passes I and II. As I said above, Pass 
II simply reorders the records, sorting them first by measure and tben by instrument. 
Within each instrument, they remain in their original order, but with some 





bar voles 

tlM 

op 

parmaatar* 







1. 

1 

1 

-99.0 

1 

1 

1 

0 







2. 

1 

1 

.1 

2 

-3 

0 

1 







3. 

1 

1 

.2 

3 

4 

4 

-2 







4. 

1 

1 

-99.0 

9 

5 

0 

2 oa*« 






I*** 

5. 

1 

1 

-99.0 

13 

3 

0 

0 

8** 

••••< 


• •*•4 

l«***4 

!•** 

a. 

1 

1 

.3 

0 

4 

2 

0 

0*+*** 





7. 

1 

1 

600.3 

0 

4 

4 

0 

o«« 

««• 





a. 

1 

1 

02S.3 

4 

2 

1 

9 

0 

0 

0 200 

0 

0 

0 

9. 

l 

1 

oao.s 

4 

1 

1 

6 

0 

0 

0 2000 

0 

0 

0 

10. 

1 

1 

760.3 

4 

0 

4 

7 

0 

0 

0 0 

1 

1 

0 

li. 

1 

1 

1000.4 

7 

1 









12. 

1 

2 

-99.0 

1 

1 

1 

0 







13. 

1 

2 

.1 

2 

-3 

0 

1 







14. 

1 

2 

.2 

3 

4 

4 

-2 







IS. 

1 

2 

-99.0 

13 

3 

0 

0 





>•«««< 

>*•* 

10. 

1 

2 

.3 

4 

-2 

1 

- 5 

0 

0 

0 0 

1 

4 

0 

17. 

1 

2 

250.3 

4 

-2 

1 

5 

0 

0 

0 200 

1 

10 

20 

18. 

1 

2 

313.3 

4 

-3 

1 

5 

0 

0 

30 0 

0 

0 

0 

19. 

1 

2 

376.3 

4 

-2 

1 

5 

0 

0 

0 0 

0 

0 

0 

20. 

1 

2 

438. 3 

4 

-1 

1 

0 

0 

0 

0 2000 

0 

0 

0 

21. 

1 

2 

600.3 

4 

0 

1 

7 

0 

1 

0 0 

0 

0 

0 

22. 

1 

2 

675.3 

4 

1 

1 

0 

1 

0 

0 0 

0 

0 

0 

23. 

1 

2 

1000.4 

7 

1 









24. 

1 

3 

*88.0 

1 

1 

7 

0 







26. 

1 

3 

.1 

2 

-3 

0 

7 







26. 

1 

3 

.2 

3 

4 

4 

-2 







27. 

1 

3 

-99.0 

13 

3 

0 

0 

0** 

49*44 



>*«•*< 


20. 

1 

3 

.3 

0 

• 

4 

0 

0«««4* 





29. 

1 

3 

125.3 

0 

4 

8 

0 

o«« 

*** 





30. 

1 

3 

ISO. 3 

4 

10 

1 

2 

0 

0 

0 300 

0 

0 

0 

31. 

1 

3 

219.3 

4 

11 

1 

3 

0 

0 

0 3000 

0 

0 

0 

32. 

1 

3 

250.3 

4 

12 

1 

4 

0 

1 

0 100 

0 

0 

0 

33. 

1 

3 

430.3 

4 

13 

1 

4 

0 

0 

0 1011 

0 

0 

0 

34. 

1 

3 

600.3 

4 

14 

1 

4 

0 

1 

0 0 

0 

0 

0 

35. 

1 

3 

875.3 

0 

4 

0 

0 

o** 

*«* 





38. 

1 

3 

907.3 

4 

13 

1 

4 

0 

0 

0 300 

0 

0 

0 

37. 

1 

3 

930.3 

4 

12 

1 

3 

0 

0 

0 0 

0 

0 

0 

30. 

1 

3 

959.3 

4 

11 

1 

3 

0 

0 

0 3000 

0 

0 

0 

39. 

1 

3 

1000.4 

7 

1 









40. 

2 

1 

.3 

4 

0 

1 

7 

0 

0 

0 100 

1 

10 

20 

41. 

2 

1 

126.3 

4 

-1 

1 

6 

0 

0 

0 1000 

0 

0 

0 

42. 

2 

1 

250.3 

4 

-2 

2 

5 

0 

1 

0 0 

0 

0 

0 

43. 

2 

1 

1000.4 

7 

1 









44. 

2 

2 

.3 

4 

2 

1 

9 

0 

0 

0 0 

1 

1 

0 

45. 

2 

2 

250.3 

4 

2 

1 

9 

0 

0 

0 200 

1 

10 

20 

40. 

2 

2 
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Figure 19. SMUT Pass III input for the Sloth Canon 
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significant changes, as we shall see. (Henceforth, we shall adopt the convention that 
an expression like "line in-24’ 1 refers to line 24 of Pass HI input* Pass I input 
appears in Fig. 18 f Pass III input in part in Fig. 19, and Pass IV input in part in Fig. 
20.) Not surprisingly, the most interesting case is that of the note. Line 1-22 
describes the first note in the top voice; it is replaced by line III-8, then IV-7 (op 
code 4 is a note in both passes HI *nd IV; see Table 2). Here is a summary or the 
information in the three forms. The coordinate systems were described in Sec. 4.3, 

1-22 pitch =33 0 (E4, no accidental), duration^ 16 0 (10th- note with 0 dots). 

III- 8 time=025.3, y= 2, head type= 1, p-stem=9 f 0 flags, 0 dots, accidental type=0, 

beam activity = 200 (begin 2 beams), begin 0 slurs and ties, slur direction=O f 
slur modifiers=0. 

IV- 7 x= 8,670, 11=250, head type= I, y-stem= 1I2Q, 0 flags, 0 dots, accidental type 

and poaition=0. 

Pass IV is a straightforward interpreter that calls externa! subroutines to draw all of 
the music; its principles were described in Sec. 4.7. 


NOTES 


|l| A slight area of fuzziness b stem direction for notes on the middle of the staff. General opin- 
ion here b that thb was formerly a matter of choice but that down-stems are now dt rijjuetir. 


(2j As mentioned in Sec. 2.3.3.1.1, ties are logically somewhat different from slurs, but. graphi- 
cally, the possible shapes and positions of ties are a subset of the possible shapes and positions of 
slurs. In fact, after Pass I, SMUT does not dbtmgubh between the two. Unless olherwbe 
specified, the term "slur" henceforth should be taken to mean “slur or tie” (however, "slur or tie" 
should not be taken to mean "slur or tie or tie”). 


(3j Actually, the argument that thb particular group had to be beamed down b rather weak, but 
the same tie placement situation would have arisen if the second note in the beamed group were 
much higher, in which case the beaming would unquestionably have had to be down. 

[4j Thb rule b particularly reasonable for two~note slurs (and ali ties), since it avoids any inter- 
section between the alur/tie and the stem of the second note. 


(5{ The problems of aligning multiple voices in parts are far less serious, when they occur at all, 
than h rro— . On the other h zzi, placing page turns (see Sec. 2.Z.ZA] b very hard in bolb, but 
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Figure 20. SMUT Pass IV input for the Sloth Canon 
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follows different principles. The difficulty of placing page turns b, for the time being, a moot 
point: no computer formatting system I know of, including SMUT, even attempts to place them 
automatically. 

[6] Prototype programs for both purposes have been written at Indiana University: the editor by 
Rosalee Nerheim, the program to combine musk and text by me. 

|7J However, SMUT does cot support all of the levels I mention in Sec. 2.3.O.2.3. 

|8] The reverse capability — - fusing several consecutive rests or tied notes into one — would be 
both useful and relatively easy to implement, but SMUT does not provide it. 

[9| This b especially surprising in two books that concentrate on 2Gth<entury notation, [STON80] 
and [READ78|. Stone clearly recognises the great importance of rhythm notation in 20th-century 
music; Read, indeed, is writing exclusively oo rhythm notation. Nonetheless, neither includes 
anything like a clear statement of principles or a comprehensive set of examples on subdividing 
notes to clarify the rhythm. (For more on Read, see note M below.) 

|10| For notation purposes, there b no need to establish a hierarchy of mtaittrct, since in CMN 
every measure must begin with a note or rest and, therefore, no decision need ever be made about 
whether to divide notes or rests at the barline. (Sec, 2,5 and particularly note 61 of Chapter 2 
cite instances of dotted notes with the notehead before the barline and the dot after, a practice 
that was not uncommon in the Ifltb and 17th centuries but b now very rare. Also, measures with 
no note or rest on the downbeat can result from grouped that crosa the barline (as in Chapter 2, 
Fig. 35, oboe), but such groupets are arguably not CMN.) However, a hierarchy of measures can 
be important for other purposes; see note 11 below. 

{11) Allen Winold has pointed out that thb scheme can easily be extended to rank strengths of 
downbeats of measures, something of value to music theorists for nonnotational purposes. One 
can simply use consecutive positive integers starting with, say, 5 for successively stranger down- 
beats. 

[12] Thb phenomenon b not at all rare; in fact, it may be recalled that we used the need to 
cotate such times as the basic justiffcation for the existence of groupets (Sec. 2.3.3.1). 

}13) Read, Ross, and Stone all give some sort of usage rules for rests; however, none of them com- 
pares them to the rules for notes or makes it at all easy for a reader to do so. Furthermore, all 
make the identical mistake of giving their rules in too concrete a form, referring to speciffc rest 
durations, when what they clearly mean b rest durations relative to the beat and, therefore, to 
the time signature. See [READ89], p. 100; [RQSS70], p. 179; |STON80j, p. 133. 

[Hj Read also gives two versions of each example, similar to my “before” and “after” versions, 
which he calls “traditional format” and “modem format”, and says that the “modern” versions 
have “eooal clarity and ! dbxgrtc, and zz do musicians 1 have asked. 

jl5| SMUT assumes that meters with five beats are divided 2+3, although 3+2 b also quite 
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possible. 

[16| la essence, thb b the rale given in (DONA63], 

{17 j (GOMB75J ghrea a more elaborate and probably belter algorithm (pp. 72-74). 

[18] The similarity between punctuation in mask setting and line-breaking and justification prob- 
lems in setting “polyglot Bibie3 ,, b especially striking. Knuth and Flaw remark [KNUT81, p. 
1166): 

One of the moit difficult challenges faced by printers orcr the yews hu been the typesetting of 
“polyglot Btcles 4 * — - editions of the Bible in which the origins! langusges are set tide by side with 
various translations * since special care is needed to keep the versions of various languages syn- 
chronised with each other. 

Knuth and Plow discuss several such Bibles printed between 1517 and 1857. Needless to say, 
these were all set by band, and Knuth and Plans say almost nothing about what automated set- 
ting of such texts would require. 

(19) The terms filling and adjutling, respectively, are sometimes used for these functions, e.g. in 
the well-known UNIX text formatters NROFF and TROFF and their documentation [OSSA78. 
KERN78b{. 

[20j Instances are occasionally found in CMN in which a clef shares horizontal space with a note 
(see Sec. 2.C fnr examples). It b debatable whether thb b really correct: it b certainly not stan- 
dard. 

[21] With respect to adding groupets to the spine, Gomberg incorrectly states (GOMB75, p. 46]: 
“One measure of the difficulty involved b that no existing automated process appears to even try 
to print grouped.” Actually, early versions of SMUT were printing groupets well before 1975 (see 
[BYRD74]) — although not the netted groupets that Gomberg tries to handle. 

{22] [GOMB75] describes in detail a somewhat different and considerably more complex spacing 
method that b apparently the standard method used by music engravers. The computer notation 
systems of Armando Dal Molin [DALM78] and Leiand Smith [SMIT73] use a “pseudo-Fibonacci 
series” approach: when the duration b doubled, the ideal spacing goes to the next Fibonacci 
number (times a constant). For reasons I do not fully understand, all three methods actually pro- 
duce very similar results, and I doubt if anyone but an engraver would care about the differences. 
My method has the advantage over the others of having two parameters that might usefully be 
varied, for example to set music with very tight spacing. 

[23] There are two differences. First, SMUT does not “hyphenate 1 *: the equivalent in CMN, 
breaking measures across systems, b indeed possible, but b far more undesirable and rarely done. 
A brief discussion of such phenomena appears in Scott Kim's book [KIM81, p. 93|. Second, 
€XCcp» uuuct Very spcCul circumstances, SMUT sever compress es : as far as it b concerned, the 
ideal spacing b the minimum — which b not really a good assumption. 



4# How SMUT Work® 


163 


(24| The equivalent in formatting most natural languages — breaking lines — can at worst involve 
breaking a word and adding a hyphen to the Erst part. Th h requires a dictionary or rules to 
decide where to hyphenate, but b otherwbe moch simpler. In German, breaking a word across 
lines may also result in changes to its spelling. 



5 

Conclusions 


6.1. INTRODUCTION 

In this concluding chapter, I will try to shed some light on the future of computer 
music setting by criticizing both my work and that of others. What approaches 
don’t work well and what approaches doT Why! I will discuss what seem to me the 
most important problem areas in detail, but will touch on other areas only briefly. 
The chapter will also be fairly speculative, 

5.2. SYMBOL PLACEMENT 

Ideally, symbol positioning in CMN should be done so that the notation not only 
conveys the correct meaning, but does so with as little effort on the reader’s part as 
possible. But it should be obvious by now that following the explicit rules of CMN, 
as given in the standard texts, is not even sufficient to guarantee correctness, much 
less maximum ease of reading. This is strongly suggested by Sec. 2.5, “Counterexam- 
ples and ‘Fully Automatic High Quality Music Notation’ It was also the burden of 
my statement in the Introduction (Sec. 1.3) that one of the goals of my work was 

to gather explicit knowledge about one of the most sophisticated notatinna! systems 
in existence, namely CMN. There are of course many books on music notation, but 
all, or nearly all, were written by musicians for musicians, and the natural result is 
that they make huge assumptions . . . Tb* closest thing I know of to an explicit 
statement , . . is still no more than, say, 5% explicit. 

A recent paper surveying document- formatting systems of many types includes a sec-^ 
tion entitled “Relations among Concrete Objects” [FURU82, pp. 458 — 59) that 
briefly discusses some of the formatting issues that arise in symbol placement for 
CMN. It brings up and criticizes some of the ideas for “constraints” found in such 
systems as T£)C (glue between adjacent objects), IDEAL (equations involving points 
ca arbitrary objects), and ThsngLab (“general” constraints). According to [FURU82], 
the technique used in ThingLab “includes equation solving as a special case, but is 
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much more general [than IDEAL'S technique] and muy u-s uaed with constraints that 
are not numeric." Two references for constraint systems that [FURU82] does not 
mention are [KNUT79], which discusses Knnth’s well-known equation-based system 
METAFONT, and [GIPS75], which discusses still another type of constraint system, 
the shape grammar with coordinates. Some of these are promising for positioning 
symbols in CMN, especially in conjunction with other approaches, but it is very 
doubtful that any of them "as is" can scire the problem. 

6.2.1. An Unsolved Problem: Avoiding Perceptual ColHsicas 

One important requirement of music setting that is still far from explicit is 
that cf izoiding collisions. A collision might be defined as a situation where two 
symbols overlap or touch. This requirement is especially interesting because it 
has close relatives in other areas of computer graphics, some of which we will 
only touch on and some of which we will consider in detail: in realistic three- 
dimensional graphics (the "hidden-surface" problem), in cartography (the problem 
of labelling map features), in design (the problem of positioning physical objects, 
e.g., components and paths in an electronic circuit), and in an area that b not 
ordinarily considered part of graphics at all, namely, typesetting text (the “kern- 
ing” problem). The latter case is different from, and mneh simpler than, all the 
others in that it is essentially one-dimensional, while they are two-dimensional; 
nonetheless, it is worth thinking about. 

As we saw in Secs. 2.4 and 2.5, many types of collbion are perfectly acceptable 
in CMN, especially where both symbols are (in some extended sense) lines. In the 
latter case the English language has a more specific term than "collision”, namely 
intersection. For example, intersections of slurs and ties with stems, accidentals, 
barlines, etc., are quite common, especially in crowded areas (see Sec. 2.5). These 
collisions are acceptable becansc it is easy to see the separate symbols involved. 
On the other hand, some situations where symbols do not actually touch but 
merely come close to each other are objectionable. So the term "collision" b 
simultaneously too inclusive (since some intersections are acceptable CMN) and 
not inclusive enough (since some near-misses are not acceptable). We seed a 
better term. Let us, then, rename a situation where two symbols overlap or touch 
a literal collision, and call a situation where two symbob overlap, touch, or come 
close to touching in snch a way that they eannat easily be separated by a trained 
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eye 3 perceptual collision. (Thus neither type of collision is a subset of the other.) 
Intersections are always literal collisions, but are not necessarily perceptual colli- 
sions. In CMN, of course, we are concerned exclusively with avoiding perceptual 
collisions. Some clearcut examples of literal collisions that axe very likely to be 
perceptual collisions would be: slurs cutting through notebeads or groupet auxili- 
ary numerals, and performance directions being superimposed on notebeads or 
beams, (In fact, these arc ail very rare in published music.) Perceptual collisions 
that are not literal collisions are harder to describe verbally; Fig. 1 shows three, 
arranged in order of increasing complexity. Fig. la is confusing at first glance 
because the accidental, which can only belong to the following note, is positioned 
closer to the preceding note. (A similar case is in Chapter 4, Fig. 13, m. 7, violins, 
where a staccato dot is too far from its note and too close to a nearby d^ualuic 
marking.) In Fig. lb, an augmentation dot on one note appears to be a staccato 
dot on another, (A similar case in published music is in Bach’s Nun kamm’, dcr 
Hcidcn Heiland , Peters edition, m, 8,) Finally, Fig. 1c is an extreme case, sug- 
gested by Hofstadter pIOFS83a], in which "correct” placement of staccato dots 
results in one being positioned inside an adjacent notehead! How can it be 
insured that such things do not occur? 

Actually, we can distinguish two classes of domain, corresponding to the two 
types of collisions. In the first, the “pictures” are intended for human viewing: 
this involves perceptual collisions. In the second, they are intended for use lu a 
machine: this requires a slight generalization of the notion of “literal collisions”. 
For example, in electronic design, layouts of printed-circuit boards and integrated 
circuits fall into the second category. They are intended to be used in making 
machines, and so they must satisfy absolutely rigid criteria such as minimum line 
widths and spacings: see, e.g., [MEAD 80 j. But guaranteeing minimum spacing 
between symbols can be reduced to avoiding literal collisions simply by surround- 
ing each symbol with an “envelope” and then looking for literal collisions between 
the envelopes. I will call violations of this type of spacing constraint literal near- 
callisions. Literal collisions axe then a special case of literal near-collisions in 
which the envelopes are identical to their objects. 

Circuit schematics and most types of maps are much more like CMN in that 
they are intended for human consumption, so that the only criterion is 



Figure 1 Perceptual collisions that are not literal collisions 



Figure 1 Collision avoidance in text setting 
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Figure 3. A semi-image-space approach to collision avoidance in text setting 
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readability. This b, of course, a function of the human pattern-recognition 
mechanism, and likely has no rigid rules whatever, as I have already argued (in 
Section 2.5, “Counterexam pies and ‘Fully Automatic High Quality Music Nota- 
tion* "). This makes the problem easier in some ways, but almost certainly 
harder overall. 1 will go into more detail on this question in Sec. 5.24.5. 

Avoiding collisions of any type can best be thought of as having two totally 
independent components: defecting collisions in a proposed layout, and raotving 
them by moving one or more of the symbols involved. Literal collisions can be 
detected relatively easily. The problem of detecting them arises also in “hidden- 
surface 11 algorithms — algorithms used in displaying projections of three- 
dimensional scenes on conventional “flat 11 displays. These algorithms work by 
computing what parts of objects are hidden by other objects closer to the 
observer’s eye and are therefore invisible. A tremendous amount of work has 
been devoted to such algorithms, and they are now rather well understood.^] 
Identifying perceptual collisions, however, is not relevant to hidden-surface algo- 
rithms and is not nearly as well understood. 

The second component of the problem — resolving collisions in proposed 
layouts — appears to be still less understood. (2] Which symbol or symbols should 
be moved, and where should it/they be moved to?[3] This aubproblem apparently 
has never been attacked in anything like a general way for either type of collision. 
Not surprisingly, there are important differences in how collisions can be resolved, 
but the differences appear unrelated to the literal/perceptual dichotomy. For 
example, in circuit schematics, any component can be translated at least a little in 
any direction, and many can be rotated arbitrarily. In CMN, on the other hand, 
most symbols can be translated along only one axis, and none can be rotated at 
all except for some rather trivial cases (symbols that have radial symmetry and 
whose rotation is therefore invisible, certain note modifiers (Sec. 2.3.6.1) that can 
be rotated by 180 de g re es ); in this respect CMN b harder to deal with. These 
differences are substantial enough that none of the large amount of work on 
automatic placement in computer-aided design seems applicable to CMN. (For a 
discussion and bibliography, see [KIRK83J.J 

I will now discuss examples of both literal and perceptual collision avoidance, 
with attention to representation, then consider how all thb applies to CMN. 
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5. 2. 1.1. One-Dimensional Collision Avoidance in Typesetting of Text 

Some one-dimensional formatting problems related to collision avoidance 
occur in typesetting of text. Despite the simple parameters — only one dimen- 
sion and literal, not perceptual, collisions — they are not completely trivial. 
Consider the spacing required to prut ths pairs of letters ll AV” and "Wo” as 
opposed to, say, “AN” and "Wh”: Fig. 2a shows ail of these as set by 
TROFF [OSSA70, KERN78b] (more precisely, by ITROFF, the Imagen laser 
printer version of TROFF) in Knuth’s Computer Modern font [KNUT82J. 
The problem here is what b called kerning. To make these look correct to the 
sensitive eye, most experts agree that one must take into account the shapes 
of the characters [BIEG70]; when "A” b followed by ”N” it requires more 
room than when it is followed by "V", since in the latter case the two charac- 
ters can share some space (Fig. 26). "Kerning” b the term for this sharing of 
space, but the concept generalizes naturally to include the opposite situation, 
i.e., the situation where allocating each character the obvious amount of space 
results in consecutive characters being too close (Fig. 2c, again set by 
TROFF). Incidentally, note that the "IT” in Figure 2c (as well as in the text 
just now) b really a single character, called a ligature . In Englbh, only a 
handful of ligatures are used, and they do not complicate kerning significantly. 

Many typesetting systems have an automatic kerning capability; TROFF 
does not, although one can accomplish it by manually overriding TROFF’s 
positioning. Automatic kerning b ordinarily done by table lookup on pairs of 
characters. A pure object-space (see Sec. 3.3.0) approach like this is hopeless 
for CMN and, in fact, dees not work that well even for text, since it requires 
large, mostly empty, tables for use within each font. Worse, kerning b some- 
times needed aero** fonts, as, for example, in Fig. 2c, where italic characters 
adjoin roman punctuation marks, and supplying a table for every possible pair 
of fonts would be a ridiculous overkill. Few if any text formatters attempt to 
kern across fonts automatically. TROFF again does not, and, for thb disser- 
tation, H had to be corrected manually in several instances (Fig. 2d). A new 
“semt-un age-space” approach, recently developed by Electronic Information 
Technology, divides each character into eight zones as shown in Fig. 3. In 
moat typesetting systems, the size of each character in a given font b 
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described by a single number giving its overall width; there are kerning tables 
of the type I have described, but no explicit information about the shapes of 
the characters. In the EIT system, however, the site information for each 
character includes eight additional numbers, each of which says how much 
space on the outside of its horizontal “half slice” is vacant. This information, 
which essentially gives the shapes of the left and right edges of the character 
with very low resolution, is used in a straightforward way for kerning. For 
text, this is in several respects a better solution than kerning tables. It 
automatically works across fonts just as well as within them. It can easily be 
parameterized to suit a typographer's taste: varying a single number can pro- 
duce any amount of kerning from none to enough to make the characters 
touch. (This is something like varying the size of the envelope in the literal 
near-collision model.) More important for our purposes, however, although this 
representation is one-dimensional, it does not appear hard to extend to the 
two-dimensional domain of CMN, for example by describing the top and bot- 
tom edges of the symbols as well as the left and right ones. 

5. 2. 1.2. Perceptual Collision Avoidance in Cartography 

A recent paper by Hirsch [HIRS82] describes an algorithm for “automatic 
name placement around point data”. Hirsch says, “Three basic map features 
annotated by names are points, lines, and areas. The placement of names for 
these features is governed by the principle that the name and its object should 
be easily recognized.” Hirsch is concerned solely with placing names on 
features of the first type so that they do not collide perceptually with other 
names or with other features of that type. He cites the opinions of various 
authorities on what relative positions of point and name are most desirable, 
and observes that, in the process of placing names around point data manu- 
ally, 

three general procedural phases can be identified. First, a given set of names 
is selected from a gazetteer. Second, during a layout phase, names are itera- 
tively placed in an attempt to find an overall solution. For instance, the car- 
tographer might start out with several promising name configurations. As he 
proceeds, it may be necessary to rearrange some of the names already plared, 
choose a smaller letter size, or exclude some names. In this phase, the gen- 
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eral procedure a to work oo the lettering draft from an area of symbol densi- 
ty outwards placing the larger names first, then the smaller. Given an ac- 
ceptable layout, the final phase is the actual placement. 

Htrsch's algorithm assumes that names are always written horitontaliy. It is 
designed to place the names so that they do not overlap, and so that each 
clearly refers to its point symbol. It attempts to do this by (1) requiring that 
the distance from any labelled point to its name be a constant, in other words, 
that the "name" (presumably, the closest point of the name) lie on a predeter- 
mined circle around its point; and (2) not permitting any part of any other 
name to enter a point's circle. Thus, the major variable is the direction from 
each point to its name. In addition, if the direction is 0, 90, 180, or 270 
degrees, the name can slide along a tangent to the circle at that angle. Hirsch 
begins by putting each name in its preferred position, then looking for over- 
laps. I could go into more detail, but it should be clear by now that this pro- 
cedure is so specific to its domain as to be of no value at all for CMN. Hirsch 
references a thesis on automatically positioning feature names on maps, which 
might be more relevant, bat I was unable to obtain a copy. 

Hofstadter [HOFS83a] has suggested a perceptual collision that might 
occur on a map that wonld take significant intelligence to detect. If a 
hyphenated name were to appear in large type on a map, the hyphen might 
appear to underline another name in much smaller type. 

6.2.I.3. Representation* for Collision Avoidance 

Perhaps the most obvious representation of a page for collision avoidance is 
with a bit-matrix, simply indicating (with appropriate resolution) whether 
each "point" on the page is occupied or not. This seems especially appropri- 
ate with raster scan displays, since such displays already have available just 
this information (the resolution being, of course, one bit per pixel). However, 
in order to be able to decide what to do when a collision is detected, we really 
need to know what symbol each occnpied point is part of. This is certainly 
feasible with some frame-buffer systems that have many bits per pixe!.[4] To 
actually describe the image, we need only one bit per pixel (CMN b rarely 
printed with either color or gray scaiel), so the remaining bits are free for the 
symbol deseriptor.(5] 
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Unfortunately, the advantage of the bit-matnx data structure disappears 
on any nonraster device, and even on raster devices where the frame buffer is 
not “memory-mapped", that is, where it is not directly addressable by the 
processor. An approach that maintains device independence and also looks 
reasonably efficient is to keep the same information with a different data 
structure, namely the quad tree . 

Quad trees were first used in the late 1660s, for example in a hidden- 
surface algorithm developed by Warnock [WARN69] (although the term “quad 
tree " is still not used universally}. They are now used extensively in several 
fields of graphics, for example animation [HUNT78] and cartography 
[WEBB83]. Hunter studied the quad tree extensively; he defines one 
[HUNT78, p. 51] as “a tree whose codes are either leaves or have four chil- 
dren." Thus, as the name suggests, they are quaternary trees. As used in 
computer graphics, quad trees recursively subdivide the (two-dimensional) pic- 
ture in question into smaller and smaller rectangular quadrants, with leaves 
most commonly representing areas of uniform value (conventionally called 
“color”, although it may be any information associated with a region) (Fig. 4). 
Thus, quad trees represent the picture with variable resolution: high in regions 
where it changes rapidly, low where it doesn’t. As is intuitively obvious, they 
are efficient in terms of memory requirements for moot applications; it b less 
obvious but equally true that tbey can be efficient in terms of computation 
time. Fast algorithms are known for many operations on quad trees, e.g., 
superimposing regions represented by them and finding neighboring regions 
[HUNT78}, and for converting between them and such other representations as 
polygons (HUNT78), boundary codes [DYER80, SAME80], and rasters (i.e., 
bit-maps) [SAME81]. Incidentally, even with memory-mapped frame-buffer 
devices, quad trees may be preferable to direct use of the frame buffer, since 
they may improve efficiency in two ways: (1) by making it possible to deter- 
mine very quickly whether or not a large region is free, and (2) by making the 
resolution the program deals with independent of the resolution of the 
hardware (this can improve efficiency because lower-than-hardware resolution 
will probably work quite well in most cases, and execution time for most algo- 
rithms b of course an increasing function of resolution}. 
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Note that both bit-matrix and quad-tree representations are of the image- 
space variety (Sec, 3,3,0), It makes sense to consider such representations, 
since the problems we are trying to solve are purely on the visual level. 

5.2. 1.4* Perceptual Collisions and Artificial Intelligence 

As early as Sec. 1.3.1 of this dissertation, 1 expressed the opinion that prac- 
tical results in computer setting of music could be achieved in the near future, 
but only by avoiding the AI techniques which, as I argued in Sec. 2.5, are vital 
for “Fully Automatic High Quality Music Notation”, Nowhere is this more 
true than in avoiding perceptual collisions. 

As I said in Sec. 2.5, making decisions in music setting can involve both 
common sense and the (local and global) semantics of the music to an arbi- 
trary extent, I cited there a number of examples of literal collisions in music 
from weii-known publishers, as well as some things that are more relevant to 
the current discussion: examples of symbols interrupted to avoid collisions 
and of symbols written with nonstandard shapes to avoid collisions. How 
could a system (whether human or computer) decide to do these things! 
Obviously, only by comparing the possible alternatives — all of which, in 
many cases, violate the explicit conventions of CMN in some way! — and 
deciding what can most easily “slip”, to use Hofstadter’s term, Hofstadter 
argues that such decisions are the key element of human intelligence 
[HOFS83bJ; 

When faced in life with a complex decision to make, with pressures being ex- 
erted on ns in various directions, often we Ond that something in oar mind 
"gives”, in the sense of yielding under pressure. We do not consciously choose 
what will give; rather, we discover, to our surprise, that something simply has 
given. In many such situations the intensity of the pressures is small enough 
that the decision is not crucial; in them, the same kind of yielding or giving 
takes place, but because the stakes are less, we often do not notice, either 
during the decision or after it, that there was a conflict to resohre or that it 
was resolved by something’s yielding. 


These kinds of situations exemplify what I call mental "slipping”: the conver- 
sion of one concept into another related one under pressure . . . slippage b Me 
mechanism, not just a mechanism, by which thought takes place in time in a 
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mmd/brain. The determination of what actually slips b the result of the 
differentia! slippabilitici of the various facets of a mental representation of a 
situation, These slippabiiities are not explicit number*, bat qualities depen- 
dent on the entire interwoven structure, and they emerge only when the en- 
tire cognitive structure is stressed from the outside, t.e., pushed or pulled or 
otherwise disturbed, and forced to yield. 

See also [HOFS79, pp, 633-40, 641 -80], and [HOFS82b], 

5*2. 1.5. Avoiding tbe Entire Problem of CoIEbkm Avoidance 

The cases we have been considering suggest the hypothesis that the 
perceptual-collision avoidance problem is so hard that workers have com- 
pletely avoided fAe problem it*elf 9 The few attacks that have been made on it 
have been so dependent on domain-specific knowledge that it b hard to see 
how they could be adapted to other areas. In music setting, i am not aware of 
anyone thus far who has failed to avoid the problem; that is, everyone (to my 
knowledge) has sidestepped it. (Additional evidence of the difficulty of the 
problem b given by ‘my Counterexamples (Sec, 2.5), It b obvious in many of 
them that the need to avoid collisions b what drove the composer or editor to 
employ idiosyncratic notation.) 

SMUT exemplifies one way of avoiding the problem: it does not even have 
a mechanism for detecting collisions, whether perceptual or literal, but it 
nonetheless attempts to prevent them, simply by following rules of thumb for 
the placement of various symhob. For example, greupei auxiliary numerab 
are positioned vertically just outside the range occupied by noteheads and 
stems of the notes in the group, so that they cannot collide with the noteheads 
or stems. However, this does not guarantee that an auxiliary numeral will not 
collide with something else, say a slur or character string; it may actually 
cau$e such a collbion by positioning the numeral farther away from the notes 
than it needs to be. In fact, SMUT as currently implemented cannot look for 
and avoid colibions, because it has no “mental image” — no Image-space 
representation — of what the page it b constructing looks like. I have already 
discussed (in Sec. 5.2.1. 3) how such an image might be maintained, both in 
terms of data structures and in terms of the information contained in them. 
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5.2*1*6. Collision Resolution In CMN 

Suppose now that we have an algorithm for detecting some type of colli* 
sion in CMN. Naturally , we much prefer to handle perceptual collisions, but 
that is rather a tall order, and (as a first approximation) the ability to deal 
with literal collisions would probably be useful. Certainly part of the 
representation involved would be information about the shapes of the symbols, 
as suggested in Sec. 5.2,1. 1. In any case, what domain-specifc knowledge is 
available to help us decide how to resolve collisions, and how should it be 
used? Of obvious importance are the permissible translations (in the 
geometric sense) of CMN symbols. But we really need to know, not just 
whether it is permissible to move a clef horizontally or vertically, 
but — assuming it collides with a character string — whether it is more desir- 
able to move the clef or the string. A first attempt at giving this information 
appears in Table 1. The larger the number a type of symbol has in the “Vert- 
ical slippability” column, the less undesirable it b to move a symbol of that 
type in that axis. “0" means the symbol type has no flexibility and should 
not be moved in that axis, no matter what the circumstances. The same holds 
for the “Horizontal srippability 11 column. The numbers given are rough 
guesses and could undoubtedly be refined. 

With this information, a basic collision-resolution procedure, on which 
many variations and improvements are possible, is obvious. (For example, one 
might have separate numbers for up and down motion instead of a single vert- 
ical motion number, and similarly for left/right motion instead of horizontal. 
Or one might define the numbers in the manner of TgJC “glue” as in some way 
limiting the distance the symbol can move, allowing larger displacements but 
with an appropriate penalty scheme to discourage them.) At the same time, 1 
hope it b also obvious, considering all I have said, that the question of “slippa- 
bility” b an extremely deep one that cannot possibly be satbfied in general by 
comparing a few numbers: shallow collision avoidance schemes like this one 
may certainly work better than none at all, but they have no hope of solving 
aU the possible sets of conflicting requirements in CMN. 
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TABLE 1 


Item Vertical Horixontal variable 

alippability alippability aspects 


Notebeads, sterna 0 

Augmentation dots 2 

Accidentals 0 

Rests 8 

Note modifiers 4 


Beams (affect stem 
endpoints) 


Slnr endpoints S 

Tie endpoints 1 

Groupet accessory numerals 5 

and brackets 

Character strings 6 

Barlines and repeat signs 0 

Clefs 0 

Key signatures 0 

Time signatures 0 

Grace notes 0 

Octave signs 7 

Glissandi 0 

Pauses 3 


6 

7 

4 

5 


LO 

3 S 

2 L 

I LO 

9 


8 


L 

LO 


Cedes for "Variable aspects": L: length; O: orientation; S: shape. 
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5.2.2. Symbol Placement by Simulated Annealing 

A recent paper by Kirkpatrick et at [KIRK83] presents analogies between sta- 
tistical mechanics and combinatorial optimization that appear to me very promis- 
ing for placing symbols in music notation. The paper’s summary says 

There is a deep and useful connection between statistical mechanics (the behavior 
of systems with many degrees of freedom in thermal equilibrium at a finite tem- 
perature) and multivariate or combinatorial optimization (finding the minimum 
of a given function depending on many parameters). A detailed analogy with an- 
nealing in solids provides a framework for optimization of the properties of very 
large and complex systems. 

Annealing is the process of altering the physical structure of a solid by “heating 
[it] to and holding pi] at a suitable temperature and then cooling pt] at a suitable 
rate for such purposes as reducing hardness, improving mach inability, facilitating 
cold working, producing a desired microstructure, or obtaining desired mechani- 
cal, physical, or other properties." [AMER61] More specifically, the type of 
annealing to which Kirkpatrick et a! refer involves melting the solid, then lower- 
ing its temperature slowly, spending a long time at temperatures in the vicinity of 
the freezing point. The result of the annealing may be thought of as minimizing 
the value of some function of the object. In simulated annealing, computations 
are done on some datum or set of data in order to minimize a given function of 
the data, usually called the “cost function". Clearly, the process also involves 
analogs of temperature and of the concepts of melting and freezing. In our case, 
the printed music is the rcUd; the amount of freedom the symbols cn the page 
have to move around independently is temperature, and melting means changing 
from no such freedom to some freedom; and the cost function is one that 
describes numerically the “badness" of the particular arrangement of symbols an 
the page. Obviously, an important ingredient of such a function will be ar*. esti- 
mate of the seriousness of any collisions involved — not an easy matter, as we 
have seen. However, given a badness measure, “simulated annealing" looks very 
promising as a technique for improvement. Referring to attacks on NP-completc 
problems, which frequently require prohibitive computational effort for exact solu- 
tions, Kirkpatrick et al write: 

There are two basic strategies for heuristics: “divide-and-conquer” and iterative 
improvement. In the first, one divides the oroblem into nnhnfob!e!ns ef raaaegc- 
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able site, then aches the aobproblems. The solutions to the subproblems moat 
then be patched back together . . . In iterative improvement, one starts with the 
system in a known configuration. A standard rearrangement operation fa applied 
to all parts of the system in torn, until a rearranged configuration that improves 
the cost function b discovered. The rearranged configuration then becomes the 
new configuration of the system, and the process b continued until no further im- 
provements can be found. Iterative improvement consists of a search in this 
coordinate 5p*ce for rearrangement steps which lead downward. Since thb 
search usually gets stuck in a local but not a global optimum, it b customary to 
cany out the process several times, starting from different randomly generated 
configurations, and save the best result. 

(Although Kirkpatrick et a! do not say so, presumably the process is always res- 
tricted to syntactically valid configurations; there would be no point to allowing 
invalid configurations. Also, in setting music, where the cost function is tremen- 
dously complex, it would be sensible to start, not with any random legal 
configuration of the symbols, hut rather with a configuration generated by 
methods like SMUT’s.) The essential difference between iterative improvement 
and simulated annealing is that the former fa deterministic while the latter is not. 
In both processes, a step (that is, a rearrangement) that improves the cost func- 
tion is always accepted. In annealing, however, a rearrangement that Aurfi the 
cost function may still be accepted, depending on a random variable. The 
rationale for thb fa to avoid getting stuck in a local optimum, to accept local wor- 
sening in hope of global improvement, to allow breaking "logjams". The anneal- 
ing process accepts or rejects steps with the following procedure; let AC be the 
change in the cost function produced by that step and let T be the current tem- 
perature. Then if AC<0 the step fa always accepted, while if A 09 the proba- 
bility of accepting it fa exp (-AC/T). Thus the probability of a locally harmful step 
being accepted decreases as the amount of harm increases, but it increases as the 
temperature increases. Note particularly that at rero temperature no harmful 
changes are accepted. The effect might be thought of, in Hofstadter’a terminol- 
ogy, as parallel slipping of many things, using some randomness to help decide 
what slips are accepted. Kirkpatrick et al argue convincingly for the superiority 
of annealing: 

Annealing . . . differs from iterative improvement in that the procedure need not 
get stuck since transitions out of * local optimum ere ehreye possible at ncaserc 
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temperature. A second and more important feature is that a sort of adaptive 
dmde-and-conqaer occurs. Gross features of the eventual state of the system ap- 
pear at higher temperatures: fine details develop at lower temperatures. 

6.2*3. Texture and the Importance of Stupidity 

Severe Ornstein, in speaking of Ms and John Maxwell's Mockingbird system, 
made the thought- provoking comment IORNS82]: 44 We were very careful to build 
a ‘stupid 1 system first into which ‘smartness 1 could be added later. That allowed 
us to finish — and handle (piano) scores of more or less arbitrary complexity. 11 
This view of stupidity as desirable because it promotes flexibility is, of course, 
much different from the usual one that a system should be as smart as possible so 
as to relieve its users from the need to specify low -level details of no interest to 
them. In fact, however, these two viewpoints dovetail nicely. The resolution lies, 
first, in the observation that Ornstein's statement is really emphasising the 
desirability of modularity of levels of control, and second, in the attitude I express 
in [BYRD8Q] and Sec. 5.3 belom , H a complex system should let users work at 
any level and change levels very easily. 

Seen from Ornstein’s perspective, SMUT suffers from a severe case, not of 
over-smartness, but of overconfidence in its smartness: its high-level machinery 
depends heavily on certain assumptions about CMN that are incorrect for a great 
deal of music — perhaps most. The most important way in which this is true 
has to do with texture (see Sec. 2.3.0) and its effect on symbol placement. 

SMUT has built into it the assumptions that each staff will have a constant 
number of voices, that that number h no more than two, and that each voice 
belongs to a single staff.[0] As I have suggested before (e.g., Sec. 1.4), all three of 
these assumptions are violated constantly in keyboard music, and occasionally in 
music for other instruments. The number of voices on a typical stafi of keyboard 
music varies continually, often reaching three, and voices frequently cross from 
one staff to another. On the other hand, SMUT’s low-level machinery could 
profitably be used on virtually all CMN if only it could be accessed without 
interference from the high-level machinery. [7] 

SMUT is not the only music-setting system with limited understanding of tex- 
ture; in fact, I know of no system that attempts to handle automatical!" mnr# 
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than two voices per staff, a varying number of voices per staff, or voices shared 
between staves. There b good reason for this: as I pointed oat in Sec. 2.3.Q.2 
and 2.4, multiple voices per staff and interdependence between staves lead to 
great complexity. This emphasises again the desirability of easy level switching* 

It is probably safe to say that the major unsolved problems of automatic 
music setting are the two aspects of symbol placement we f ve just discussed, 
namely collision avoidance and handling of complex textures. Neither is likely to 
be solved in a very satisfactory way any time soon. Thb b especially true 
because the two are closely interrelated: complex textures are much more likely to 
lead to problems with collisions than arc simple ones. Sec Sec. 5.8. 

5.2.4. Modularity of Knowledge and Single-Character Placement 

The issue of modularity of knowledge relates to some symbol-placement prob- 
lems that are much simpler than those of eollbion avoidance. A very common 
problem in all kinds of formatting systems b that knowledge of bow much space 
various symbols occupy is stored in more than one place, so that it b possible for 
different parts of the system to behave meonsbtently. TROFF, for one, appears 
to suffer from thb problem. The version of TROFF available to me when I began 
working on thb dissertation had problems spacing text properly, but set the 
mathematics of Figs. 48 and 47 of Chapter 2 correctly. A later version did better 
with text, but much worse on those figures. A few instances of improper spacing 
of text still show up in this dissertation: for example, look at the "bio” in the 
word “collision” in any of the section headings of this chapter. TgJC also suffers 
from the problem, as does SMUT. The problem b a curious one in that, when 
symbols are drawn by software, it is (in principle, at least) easy to solve; when 
they are drawn by hardware, it b more or less unsolvable, since no part of the 
software really knows how much space the symbols need, but b also less likely to 
cause trouble, since symbol space requirements are likely to change far less often. 

Note, incidentally, the relationship of thb problem to kerning and to eollbion 
avoidance: thb b a local, one-symboKat-a-time problem, while kerning involves 
two symbob at a time, ligatures (in Englbh) involve two or three, and eollbion 
avoidance — at least in CMN — any number. But thb problem b also basic to 
kerning and eollbion avoidance in that if a system does net have accurate 
knowledge of the space requirements of individual symbob, it cannot possibly 
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handle other symbol placement problems. 

Clearly, eyea if the symbols are drawn by software, the only rontine that 
really knows how mnch space a symbol occupies is the one that draws it. In 
SMUT, all of the symbol-drawing routines are in Pass IV; bat in order to do 
punctuation, Pass III needs to knsif how much space the symbols occupy, at least 
horizontally (the current implementation concerns itself with vertical space 
requirements only in a very crude fashion). Subroutine CALCXY answers ques- 
tions from JUSTIF about horizontal space requirements. In addition, there are a 
few cases where Pass IV routines do small-scale, local, vertical positioning and 
therefore need to know how much vertical room other Pass IV routines will need 
for their symbols. 

Currently, routines in SMUT have built into them assumptions about the 
space requirements of symbols drawn by other routines, mostly via DATA state- 
ments. A much better way to handle this would be to aik a symbol-drawing rou- 
tine how much space a given symbol will occupy. This could be done simply by 
adding two parameters to the calling sequence of each such routine, one input to 
specify “print” or “return information”, and one output in which to return the 
spacing information. 

6.3. EDITING AND FORMATTING, DESIGN AND DRAFTING 

The work of preparing visual information in many disciplines, including music, for 
presentation to others is commonly divided into “editing" and "formatting”. Two 
recent surveys of editing and formatting, respectively, that appeared in a single jour- 
nal issue are those of Meyrowitz and Van Dam (MEYR82] and Furuta et al 
[FURU82]. Each paper mentions work on various types of information bat concen- 
trates on text(8], and each acknowledges the relevance of the other area. Furuta et al 
define two types of hierarchies of objects, one composed of “abstract” objects (in 
test, items like characters, “logical" words «hich might appear in dictionaries, para- 
graphs, sections, headers) and one of "concrete” objects (in text, items like represen- 
tations of characters in a particular font, two-dimensional words with possible hyphe- 
nation, and lines). “A document is an object composed of a hierarchy of more primi- 
tive objects", presumably (though they do not say so) either all abstract or all con- 
crete. In this terminology, editing operations are mappings from abstract objects to 
abstract objects or concrete objects to concrete objects. Formatting operations are 



6. Conclusion! 


183 


mappings from abstract objects to concrete objects. Meyrowitz and Van Dam take 
the interesting position that formatting is really an integral part of every editor, and 
suggest that multiple formats are becoming more important. 

Steven Johnson has pointed out (JOHN81) a more abstract and general way of 
looking at the process, namely in terms of what the user is trying to accomplish, sot 
what she or he is doing to accomplish it. In this view, the dichotomy is not “editing" 
versos “formatting", bnt “design" versos “drafting". (In the other view, it is hard to 
classify input as either editing or formatting; here, it is clearly part of design.) Com- 
puter processing of mosic is one of a class of problems that might be called 
“computer-aided design" in a much broader sense than the usual one. The phrase is 
ordinarily applied to the automation of the fields that themselves go under the name 
"design” — - electronic and mechanical design, for example; but computer processing 
of music and text, among other things, has many similarities to computer-aided 
design. This terminology, again, is used only in the “design” fields. In music, design 
is called “composing", and drafting is called “copying" (or “engraving" or "autogra- 
phy", etc.); wandering slightly outside of graphics into text processing, we find the 
terms are “writing" (supported by text editors) and "formatting" or “setting". 
Whatever terms are used, the essence of the distinction b that design, in the strict 
sense, is concerned primarily with semantics and secondarily with syntax. Drafting, 
on the other hand, is ideally concerned only with graphics and possibly syntax (see 
Sec. 2.4 for terminology) and should under no circumstances alter the semantics of 
the material. Thinking more concretely reveals the usefulness of considering the pro- 
cess as having a third part, what is usually called "editing” but might be better 
termed “design revision". The reason for this distinction is, of course, that in most 
real-life situations the design and drafting tasks do not occur strictly sequentially, 
but instead one switches back and forth between them more or less frequently. 
Ideally there should be no overhead iu the switching, so that one can do it as often as 
desired without incurring any penalty. (This switching has occurred in music compo- 
sition to a very limited extent simply because the nonautomated tools hitherto avail- 
able have imposed so much overhead. This should be evident to anyone who has 
looked at a typical manuscript of Beethoven, with its mnitiple scratched-ont versions 
of a single passage.) In any case, my research has been concerned solely with the 
drafting subproblem; however, it b important to realize that an integrated system 
that deals with both design and drafting will nearly always be more useful than two 
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separate systems because it will greatly facilitate the switching. 

In fact, in most fields, two types of systems hare traditionally been built, but the 
types really correspond to a “batcb/interactive” dichotomy, not to the 
“design /drafting” dichotomy. In a discussion of computer-aided design for generat- 
ing integrated circuit mask layout data, Trimberger comments [TRIM81}: 

Two primary methods for generating integrated circuit mask layout data are infer- 
active graphic* and layout language*. Each has tasks which it docs well and those 
which it does not . . . Often the limiting factor in the speed of design b the time it 
takes to plot the data. Interactive graphics systems provide "instant plotting 41 , ena- 
bling the designer to iterate extremely quickly on the design. Interactive graphics 
systems also provide a powerful “language” for handling the data. For example, the 
user may point to the object of bb attention or to a desired position, rather than 
> search for certain numbers in m program printout or type numbers . . . But interac- 
tive graphics systems do not allow , . , looping constructs [except some that are] 
severely limited, conditional geometry or relative positioning. Much more powerful 
language-style operations are needed. Layout languages attempt to resolve these 
problems . . . Unfortunately, languages specify graphic positions in an awkward 
fashion, by numbers . . . Current languages foree the user to go through a tedious and 
time-consuming edit-compile-pbt cycle. Interactive techniques have attempted to 
get rid of tbb lengthy cycle, but have been usually aimed only at the graphic form 
and not at the language form. 

With minor rewording, these comments apply almost equally well to text processing 
or to music editing and formatting. The analogy to music is slightly better because 
it usually has mere repetition than text. 

Even if the design phase is done manually, a computer must obviously have the 
resulting design fed into it before any drafting can be done. In most fields, certainly 
including music, this is so far from trivia! that substantial editing is usually required 
to correct errors in the input process. Of course, substantial editing can be done 
without a good editor — in particular without one that is good for the problem 
domain. But having such an editor certainly helps, and if one is available, it is hard 
not to think of doing the design work on-line. 

£.3.1* Supporting Multiple Levels of Control 

In the paper just cited, Trimberger goes on to discuss his Smalltalk-based 
“Sam” system, which attempts to combine the two approaches by maintaining a 
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data base that is independent of both the graphics and the language form, and 
from 'which the two forms are generated, displayed on the user’s terminal in 
separate windows, and continuously updated. 

The left side (of the display] shows the program view of the design under edit, 
the right side shows the graphics view. The user may move the viewing location 
in cither window and may make edits to the data in either window. When the 
design b changed in either window, the change b reflected immediately in both 
windows. The data dbplayed in the windows are pictures of the data structure. 

The data structure is the base form . . . When the user makes what appears to be 
a modification of the data in either window, the commands are translated into 
calis co procedures in the data structure to carry out the action. The data struc- 
ture makes the modification and causes both dbplays to be updated . . . The data 
structure . . . b more than a conventional design automation database, consisting 
as it does of objects which have both data and code attributes. 

Trimberger discusses two problems — “expression update” and “iteration 
update” — that arise because his system allows changes to be made “in two 
different forms which must remain consistent.’ 1 Both of the problems occur when 
the user has described something at a high level of description, i.e. p algorithmi- 
cally, and now modifies an instance of it at a low level, i.e., graphically, or one 
might better say, “manually”. This sort of level changing, and either problem, 
could occur in editing a piece of music notation. So can the reverse kind of level 
change, going from a low level to a high one, as described in the following excerpt 
from [BYRD80J: 

Now let’s say that a user formats hb/her music; then, in order to save vertical 
space between staves, makes a law-level change: “move the left end of tbb beam 
5 units right”, (Music engravers actually do thb sort of thing.) Now the user dis- 
covers a whole measure missing and inserts it — a much higher-level change 
which requires completely repositioning the beam. To do the right thing with it, 
the program will need to understand why the user moved the beam to begin 
with. 

Other problems of going from a low level to a higher one are legion: in the 
typesetting world, for example, doing any fancy “manual” alignment at all, then 
either changing the wording and reformatting, or switching feats (and therefore 
character widths). In the latter case, both changes are made graphically, but 
there is still a change in level of description: now, something has been described 
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at a moderate level and “manually" modified at a low level, and the user then 
wants to specify a change at a high level in the same “neighborhood". This ques- 
t :sn cf switching levels is crucial. 

Working at a low level must be allowed so that users can “manually" do 
things that the system does not understand how to do. This is clearly of great 
relevance to CMN not only because, as I attempted to show in Secs. 2.5 and 
5.2.1.4, no program will be able to handle CMN totally satisfactorily until the 
entire problem of AI is solved, but more importantly because for a long time to 
come, even many less difficult problems of CMN will be more easily settled by 
human intervention than by automatic methods.[9j Switching levels should be 
easy because situations like the one I just described (the example of “going from a 
low level to a high one") will presumably be quite local in any given piece or work 
but may occur in it many times; as a result, the user will want to work at a high 
level most of the time, and switch to a lower level for brief periods but frequently. 

A solution in the text domain to some of the problems of supporting both high 
and low levels of control, in many ways similar to Trimberger’s approach, b 
found in the Xerox STAR (SEYB81, MEYR82], A basic problem with the com- 
mon “what-you-see-b-what-you-get" text formatting systems (e.g, t WORDSTAR) 
is that, once they have finished a formatting operation, they do not retain any 
memory of the operation’s parameters. Although the system supports high- as 
well as low-level operations on the text, it only retains very low-level information, 
namely the current formatted version of the text. In contrast, the STAR text set- 
ting system maintains for each document one or more “property sheets" for each 
of several entities: characters, paragraphs, page layout, etc. For example, the 
paragraph property sheet includes such information as alignment (flush left, flush 
right, or centered), whether to hyphenate, margins, and so on. Each type of pro* 
perty sheet can change at any point in the document. This provides a record 
that describes the formatting of the text and that can itself be edited. The record 
b conceptually high level, but, since it can change arbitrarily frequently, any of 
the properties can apply to a very small amount of the text, perhaps only a single 
character. 
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5.4, AUTOMATING WHAT HAS ALWAYS BEEN DONE MANUALLY 

As Knuth has shown [KNUT79, KNUT81], automatic text formatting by com-* 
puter b now advanced enough to give results superior to manual in some ways, and 
equal to manual in most. Not only b formatting of music inherently far more 
difficult than that of text (as we saw in Sec. 2A), but far less effort has been 
expended on it. As a result, the best computer-set music — and by "computer-act” I 
mean here fully automatically computcwct in the sense of Sec. 1.3.1 — b still clearly 
inferior to the best manually set music in most ways. However, this does not mean 
that we should proceed by slavbhly trying to imitate manual techniques in every 
case. 

An example of this mbtake, it seems to me, b Gomberg's general attitude 
towards music engraving, and specifically his algorithm for determining “ideal” hor- 
izontal space requirements (GOMB75, pp. 42-69]. It is far more complex than it 
needs to be because he attempts to use the same technique engravers use. On the 
other hand, sometimes it really does pay to imitate manual techniques if this b done 
properly, where the meaning of the word “properly 1 1 b not easy to specify. In 
[KNUT79] Knuth discusses a similar question about the production of digital versions 
of typefaces, which several people had approached as the purely technical question of 
making good digitized copies (p. 17ff):(l0] 

I felt that the whole idea of making a copy [of an existing font in digital form} was 
not penetrating to the heart of the problem. It reminded me of the anecdote I had 
once beard about slide rules in Japan. According to this story, the first slide rule 
brought to the Orient had a black speck of dirt on it; so for many years all Japanese 
slide rules had a useless black spot in the same position! The story b probably apo- 
cryphal. but the point is that we should copy the substance rather than the form. I 
felt that the right question to ask would not be “How should this font of type be 
copied?” but rather; “If the great type designers of the past were alive today, how 
would they design fonts for the new equipment?” 

Similarly, in [KNUT82] Knuth describes the way hb METAFONT system draws 
characters by simulating a pen whose nib's size and shape are variable. Thb tech- 
nique does not imitate modern design of characters for phototypesetting or, formerly, 
printing with movable type; rather it imitates the technique used by the calligraphers 
from whose work modem letterforms evolved. 



5. Conclusions 


188 


Goraberg was certain! j aware of this issue of how tasks should be automated, as 
shown by his observation (p. 30) that 

There are essentially two ways one can attempt to accomplish the activity of a hu- 
man being with the aid of a computer. One b to duplicate as nearly as possible 
every step of the human activity . . . The second approach . . . b to attempt to use 
the same input as the human being and, by whatever means possible and convenient, 
produce the same output as the human being* 

He then gives several reasons why a program cannot really duplicate the behavior of 
a human music engraver: 

(The engraver] has eyes with which he can quickly scan for important features and 
ignore those that hb experience tells him are trivial* He can easily adjust and read- 
just hb field of concentration, if necessary, from to one to several pages. He uses 
short-term memory, bat he can refresh it with the important details literally “at a 
glance”. Thus he b able to make gross level judgements about whether there are 
special problems that need to be considered in some detail before proceeding further, 
or whether the current pages ... are quite ordinary. 

Therefore, Gomberg says, “alternative means must be found.” 

5*5. PUNCTUATION AND CASTING OFF 

In punctuation and casting off, music-setting systems have much to learn from 
typesetting systems, in particular TeJCs “glue”, “boxes”, and “penalties”. The 
whole- movemenfc-at-a-time method for casting off that Smith uses in MSS also looks 
excellent, if only because of the peculiar tradition that a piece of music completely 
fills its last page (Sec. 2.3.6.4J. Unless one attempts automatic page-turn decisions 
(an attempt that would be likely to fail: again see Sec. 2*3,6.4), the constraints 
involved in the punctuation process are simpler than those in collision avoidance. 

5«0» RESEARCH PROBLEMS 

Here is a brief list of some of the remaining research problems in computer music 
setting, nearly all of which I have already discussed. As I have tried to make clear in 
this chapter, the first two problems are of great difficulty and great importance, both 
practical and theoretical, for many fields, not just music setting. Most of the ethers, 
though very difficult to solve fully, are of relatively slight practical importance, since 
they can be handled rather easily by interactively correcting rough algorithmic solu- 
tions. 
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(1) Automatically placing symbols to avoid perceptual collisions and maximite rea- 
dability, especially in complex textures* This problem appears to be “AI- 
complete 1 * (see Chapter 2, note 47). 

(2) Supporting multiple levels of control in a really convenient way. 

(3) Deciding automatically rhere to put page turns, in scores as well as parts. 

(4) Deciding automatically what cue ncUs to use, if any, after long rests in one 
performer's part. 

(5) Deciding automatically on clef changes* 

(8) Deciding automatically what groups of notes to beam together* 

(7) Choosing cautionary accidentals automatically. 

The various input methods bring in a host of additional problems, which are dis- 
cussed in Chapter 3, 

6*7. SOME CONCLUDING SPECULATION: PROSPECTS FOR 

COMPUTER-SET MUSIC 

Using as an example Elliott Carter’s exceptionally complex Double Concerto for 
Harpsichord and Piano with Two Chamber Orchestras , engraved in about 1982, Com- 
berg speculates rather pessimistically about the future of computer music setting 
[GOMB75, p. 890]: 

The estimate that I have received from people in the industry is that probably no 
page of the Carter was set for less than fifty dollars, and that [some] pages — might 
well have cost in excess of one hundred dollars. It is probably reasonable to estimate 
that these figures would be doubled today ... I have tried to project very carefully 
the cost of producing an ‘‘average' 1 page of the Carter Double Concerto by means of 
the automatic processes described in this paper . . . After first making this estimate, 
which includes all labor and computer-related costs, I doubled it in order to allow for 
error and personal prejudice, and arrived at a figure of about twenty-five dollars a 
page . * * it is not possible to say that for music M ha!f as difficult as the Carter* 1 the 
cost would be reduced by half, but it b certainly true that one can expect to com- 
pete more than favorably with existing processes . . * Despite such a promising cost 
projection, the major obstacle to the culmination or this project as a commercially 
viable venture b lack of funding. 

Gcmbsrg estimates total development cost to complete his project (which, it will be 
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recalled, was intended to be as automatic as possible) at $500,000, and suggests that 
the music publishing industry will be unable, and foundations and venture capitalists 
will be unwilling, to supply these funds* 

Prospects for the completion or similar work now are Ter y much better. For one 
thing, computer resources are, of course, much cheaper and more readily available 
than they were in 1975. For another, several workers, including myself, have already 
put a large amount of effort into computer setting of music, mostly without external 
funding. In fact, to my knowledge there are now three systems in regular use for 
music publishing (the Dataland Scan-Note system, Watkins’s Musigraph, and 
Armando Dal Molin’s), and several companies that make digital synthesisers have 
announced computer music-setting facilities of some sort (see Sec. 3.4 for details). (In 
1975 there were no commercially available digital synthesisers, much less music- 
setting facilities for them.) No existing system comes close to Gomberg’s go •'is of put- 
ting every jot and tittle in place automatically, but there can hardly be any doubt 
that systems will continue to improve, and — considering the high level of interest 
now evident — probably at a rapid pace. As I have tried to show, there is a funda- 
mental and clear-cut tradeoff: if one insists on “Fully Automatic High-Quality Music 
Notation”, one should expect to wait until most of the major problems of artificial 
intelligence are solved (not likely within my lifetime}; but if one is willing to accept 
practical compromises, a decade should more than suffice. 


NOTES 

jlj In fact a paper comparing no less than ten such algorithms appeared some years ago 
(SUTH74j. Both of the standard texts on computer graphics, (FOLE82) and (NEVVM79), include 
extensive discussions of hidden-surface algorithms (as well as hidden-line algorithms, which are 
closely related). 

[2] Neither (FOLE82J nor [NEWM79] even mentions this problem. This b perhaps understand- 
able, since they are dealing with infers dive computer graphics, and collisions can always be 
avoided by interactive methods, without requiring any knowledge of collisions on the program's 
part. However, I have not seen a general discussion of collision avoidance anywhere else. 

[3] Or should changes be made that might, as far as a program knows, change the semantics of 
the material! Tub b not at ail unheard of m manual setting of CMN (or, for that matter, text. 
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where it implies changing the wording); in fact 1 know of one instance [HASSS3) in which a com- 
poser made a change that unquestionably affected semantics* specifically rhythm, in order to 
avoid a collision. However, any such behavior obviously requires a great amount of intelligence, 
so I will ignore the possibility. 

[4] In particular, it should be doable with any ^buffer system, substituting symbol descriptors for 
^coordinates. 

[5] In fact, for various reasons, music is occasionally printed with two or three colors, and for the 
same reasons plus feedback it might well be useful for a computer system to use colors. This 
could be accommodated simply by allowing two bits per pixel instead of one. See Chapter 3, note 
18 . 

(Q| As I pointed out in Sec. 2.3.0, the term "voice’ 1 is ambiguous. The meaning I intend here 
allows "thickening’ 1 . 

|7j SMUT actually does have a mechanism to do this, but only to a very limited extent. Its 
SHARE command dynamically tells SMUT to behave as if the current voice were the upper voice 
of a shared staff, the lower voice of a shared staff, or on an unshared staff. See Sec. 4.3. 

|8| Meyrcfwit* and Van Dam make the statement that "graphics editors are described in 
JNEWM79) and |FOLE82j M . This is, unfortunately, not true: |NEWM79) and [FOLE82] only 
describe general graphics principles, which could of course be applied to build a graphics editor. ! 
know of no published survey of graphics editors. 

(9| Programming languages provide many good examples of the importance of providing various 
levels of control. One » the well-known weakness of PASCAL for systems programming because 
it does not allow access to the low-level aspects of conventional machines. In my opinion, C has 
been more successful than PASCAL mostly because of its wider range of levels of control: it goes 
equally high in level and much lower. 

|I0) The same question might be asked about present-day electronic musical instruments, several 
of which go to great lengths to imitate the timbres of acoustic instruments. The analogy is an 
imperfect one, however: there have been type designers for centuries, but only in the last few 
years has technology approached the point where someone could possibly act as a "timbre 
designer" without spending most of their time on side issues involving (for example) the physics of 
cylindrical tubes. 
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"I think l should understand that better," Alice said rery pol- 
itely, “if I had it written down: but I can’t qnite follow it as yon 
say it," "That's nothing to what I could say if I chose,” the 
Dochess replied, in a pleased tone, 

Lewis Carroll, Alice in Wonder- 
land, Chapter DC 

The Red Queen shook her head, "Yon may call it 'nonsense' if 
yon like," she said, "bat I've heard nonsense, compared with 
which that wonld be as sensible as a dictionary!" 

Lewis Carroll, Through the 
Looking Glatf, Chapter 11 



Postscript 


The year since this dissertation was written has seen extensive work by several persons 
on computer notation of music and related topics. All I can do here b moke a few brief 
comments and point to recent publications. 

Jim Miller of Electronic Screen Productions has developed a highly interactive system 
that performs a number of functions in addition to CMN editing and output. Hb “Personal 
Composer" runs on an IBM Personal Computer with any oC several commercially available 
digital synthesizers connected via MIDI (the Musical Instrument Digital Interface, a 
recently-developed serial interface standard) for real-time clavier input and audio output. 
The notation, produced on a dot-matrix printer, b nowhere near engraving quality but is 
quite readable. “Personal Composer" is scheduled for distribution in the near future by 
Yamaha Corporation. See [FREF83], 

Passport Designs’ new “Polywriter" (see the company’s advertising literature [PASS84]) 
runs on an Apple II computer and produces output on a small dot-matrix printer. It 
accepts input, apparently in real time, from Passport Designs’ own “Soundchaser" or via 
MIDI. It ca'i handle 16 staves per page, and apparently up to 12 notes on a single stem per 
staff. Quality of the notation is poor, and Polywriter has some severe limitations, e.g., no 
durations shorter than triplet ICths and very little flexibility in page layout. On the other 
hand, it can transpose, and it knows some of the notaticna! properties of instruments, 
specifically position in an orchestral score and transposition. In the words of Chris AJbano 
of Passport Designs [PASS84], “Polywriter is the most significant development since the 
music typewriter." 

Syntauri Corporation is marketing still another program (called "Composer’s Assistant") 
that takes real-time clavier input and prints it in CMN on a dot-matrix printer. Syntauri 
makes an inexpensive synthesizer that b driven by an Apple II computer, the computer on 
which Composer’s Assistant runs. However, a recent review of the “alphaSyntauri" music 
system [AJKI83] says, “In its current form — thb program still has some fairly severe 
limitations . . . What (it) needs b some editing software that would allow you to interact 
with your score while it’s still on the screen ..." 
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] only recently became aware of a system in use at the University of Illinois 9 Computer- 
based Education Research Laboratory (CERL). Carla Scaletti wrote of it [SCAL84]: “The 
CERL Music Group currently supports a music print program, written by Lippold and 
Dorothea Haken, which is heavily used by both musicologists and composers at the Univer- 
sity of Illinois, and which allows for graphic or alphanumeric editing and the audition of 
entered scores.” 

The Xerox PARC Mockingbird system has now been described in an article in a widely- 
availabte publication [MAXW84]. 

Turning to a more theoretical area, in Sec. 5.2 I mentioned constraint systems as a 
promising approach to the symbol placement problem. A recent paper by David Levitt 
[LEV784] discusses applications of such systems to music, although not specifically to music 
notation. 
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SMUT USER'S GUIDE 
INTRODUCTION 


THE FOLLOWING MATERIAL IS EXTRACTED FROM THE SMUT MAIN PROGRAM. 


C 

C 

C 

C 

C 

C 

C 

C 


PROGRAM SMUT( INPUT=200, GUTPUT=2G0, PL0T=200, TAPE9=200, 

S ZSMUT1=520, ZSMUT2=520, ZSI1UT3-52G, ZSMUTX-520, TAPE5=INPUT, 

$ TAPE6=OUTPUT, TAPE8=PL0T, TAPE10=ZSMUT1, TAPE1 1-ZSMUT2, 

$ TAPE12=ZSMUT3# TAPEX3=ZSMUTX ) 


TO IANNIS XENAKIS 
SMUT 

SYSTEM FOR MUSIC 
TRANSCRIPTION 


C 
C 
C 
C 
C 
C 
C 
C 
C 

CCCCCCCCCCCCCCC 

c 

xxxxxxx 


DONALD BYRD 
INDIANA UNIVERSITY 

BEGUN SEPTEMBER 1968. . > SMUT VERSION 1.2, JULY 1975 
POLYPHONIC VERSION 2. 0, SEPTEMBER 1977. 

VERSION 2. S TO SUPPORT SHARED STAVES* MARCH 1982. 
VERSION 2. 9* JANUARY 1983. 


X 

X 

X 

X 

X 

X 

X 

X 

X 


XXXXXXX 


COPYRIGHT (C) 1972,1975,1976,1977,1979*1979, 
1980, 1981, 1982, DONALD BYRD 


CCCCCCCCCCCCCCC 


COMMENTS, GUEST IONS, ETC. ARE WELCOME AND SHOULD BE ADDRESSED 
TO THE AUTHOR AT 

KURZWEIL MUSIC SYSTEMS 
411 WAVERLEY OAKS ROAD 
WALTHAM, MASSACHUSETTS 02154 
OR CALL 6 1 7—893—5900. 

THIS PROGRAM TRANSLATES ALPHANUMERIC DATA INTO STANDARD MUSICAL 
NOTATION AND OUTPUTS IT TO A DIGITAL PLOTTER OR SIMILAR DE- 
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C VICE. SMUT 2. 9 CAN DRAW SCORES WITH ONE OR TWO VOICES PER 

C STAFF. OR PARTS WITH ONE VOICE. IN SCORES, ALL BARLINES MUST 

C COINCIDE. 

C A DESCRIPTION OF THE DATA FORMATS MAY BE FOUND IN THE LISTINGS 
C OF SUBROUTINES INITSM AND PASS1 , NOTE, HOWEVER, THAT THE 

C DATA FORMAT IS RATHER CLUMSY TO PREPARE MANUALLY AND THERE 

C WILL RARELY BE ANY NEED TO DO SO, SINCE PROGRAMS EXIST (SMIRK 

C AND JANUS) THAT TRANSLATE MUSTRAN DATA AND MUSIC V NOTE CARDS 

C INTO SMUT COMMANDS. 

C FOR FILE USAGE AND INSTALLATION PARAMETERS. SEE THE BLOCK DATA 
C MODULE. 

C IMPORTANT GENERAL COMMENTS ON THE STRUCTURE OF THE PROGRAM ARE 
C IN THIS ROUTINE, WRITE1 , PASS2 . PASS3 , AND JU5TIF . 

C SMUT AND RELATED PROGRAMS (INCLUDING MUSTRAN, MUSIC V, SMIRK. 

C AND JANUS) ARE DISCUSSED IN DONALD BYRD, 'AN INTEGRATED COM- 
C PUTER MUSIC SOFTWARE SYSTEM', COMPUTER MUSIC JOURNAL 1.2 (JULY 

C 1977). A STANDARD MUSIC NOTATION TEXT WHICH I HAVE OFTEN IG- 

C NORED HEREIN IS GARDNER READ. MUSIC NOTATION, 2ND EDITION 

C (BOSTON— CRESCENDO, 1969). SEE ALSO TED ROSS, THE ART OF 

C MUSIC ENGRAVING AND PROCESSING (MIAMI— HANSEN, 1970). 

c 

C 'THIS PROGRAM I HAVE LEARNED FROM MY USERS. ' I AM GRATEFUL TO 
C ALL MY COLLEAGUES AT THE WRUBEL COMPUTING CENTER, INDIANA 
C UNIVERSITY, AS WELL AS TO THE MUSICIANS WHO HAVE STRUGGLED 
C WITH SMUT . I ESPECIALLY WISH TO THANK JEFF HASS, JIM HETT- 
C MER, DOUG HOFSTADTER, DAVE KRIEWALL, ROSALEE NERHEIM, BRUCE 
C ROGERS, DAVID ROTH, AND GARY WITTLICH FOR MANY HELPFUL COM- 
C MENTS AND FOR GENERAL MORAL SUPPORT. 

n 

u 

c 

c THE FOLLOWING COMMENTS ARE FOR THE BENEFIT OF THOSE FAMILIAR 
C WITH EARLIER VERSIONS OF SMUT. 

C 

C DIFFERENCES BETWEEN SMUT 2.9 AND SMUT 2.0 (YOU DECIDE WHETHER 
C THEY'RE INCOMPATIBILITIES OR NOT)... 

C 1. THE R COMMAND NOW HAS AN OPTION FOR DESCRIBING MULTI-BAR 
C RESTS. AND ITS Y-POSITION PARAMETER IS NOW IN HALF-SPACES 

C (NOT SPACES). 

C 2. A NEW OPTION, KIND=+2. HAS BEEN ADDED TO THE 'S' COMMAND. 

C 3. A Y-POSITION PARAMETER LIKE THAT FOR THE 'A', 'P', AND 'R ' 

C COMMANDS HAS BEEN ADDED TO THE 'T' COMMAND. 

C 4. SOME OPTIONS HAVE BEEN ADDED TO THE 'B' COMMAND, AND THE 
C MEANING OF THE PARAMETER'S SIGN (WITH AUTOBARRING) PE- 

C VERSED. 

C 5. 12 HEADER /FOOTER LINES ARE ALLOWED INSTEAD OF 5. 

C 6. ADDITIONAL LAYOUT LINE PARAMETERS INDICATE WHAT QUALITY 

C IS NEEDED AND CHARACTERISTICS OF THE PLOTTER BEING USED. 

C 7. SOME NOTE-RELATED SYMBOLS ARE NOW ALWAYS PLACED ABOVE THE 
C STAFF (E. G. UPBOW, DOWNBOW) WHILE OTHERS GO ABOVE OR BE- 

C LOW THE NOTEHEAD (E. G. ARTICULATION MARKS). 

C 0. THE FIRST COMPLETE BAR IS NOW LABELLED BAR 1, RATHER THAN 
C A PARTIAL BAR (WHEN THERE IS AN ANACRUSIS). 

C 9. IF A SYSTEM IS NOT RIGHT-JUSTIFIED, ONLY THE PDOTIQM n«r 
C THE STAVES ACTUALLY USED IS DRAWN. 

C 10. THREE NEW CONTROL COMMANDS HAVE BEEN ADDED - '•* HCROWD' 

C TO CONTROL HORIZONTAL SPACING, AND '* MEASNO'/'* NOMEASNO ' 
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C 

C 

f 

w 

C 

C 

C 

C 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


11 . 

12 . 

13. 

14 . 

15. 

16. 

17. 

ia. 

19. 

20 . 
21 . 
22 . 

23. 

24. 

25. 

26. 
27. 

20 . 

29. 

30. 

31. 

32. 

33. 

34. 


TO DETERMINE WHETHER MEASURE NUMBERS SHOULD BE WRITTEN ON 
THE MU3IC OR NOT. 

A PARAMETER HAS BEEN ADDED TO THE 'S' COMMAND THAT MAKES 
SOME TIME SIGNATURES NOT CONSISTING OF NUMBER-ABOVE-NUMBER 
(E. G. 'C', THE SINGLE NUMBER '3') DOABLE. 

RHYTHM DECOMPOSITION HAS BEEN GENERALIZED TO HANDLE ANY 
METER WITH NO MORE THAN 10 BEATS PER MEASURE. 

CONTROL COMMANDS '* SHARP' AND '* RHLIN ' HAVE BEEN CHANGED 
TO '•* NOFLAT ' AND '# NORHDEC '* RESPECTIVELY. 

THE DEFAULTS FOR LAYOUT LINE PARAMETERS HS , YDELT HAVE 
BEEN CHANGED. 

THREE NOTE -ASSOCIATED SYMBOLS ('AH', 'TW', 'MR') HAVE BEEN 
ADDED. 

AN INITIALIZATION LINE CAN NOW SPECIFY HOW VOICES ARE 
GROUPED, AND THERE IS AN OPTION ON P COMMANDS TO MAKE 
THEIR EXECUTION CONDITIONAL ON VOICE GROUPING. VOICE 
GROUPING INCLUDES HAVING TWO VOICES SHARE A STAFF. 

THE LAYOUT LINE MUST HAVE AN 'L' IN COLUMN 1. 

RESTS CAN NOW HAVE FERMAT I AND PAUSES AS ASSOCIATED SYM- 
BOLS. 

A VERSION OF READ4 , THE PASS4 INPUT SUBROUTINE, CONTAIN- 
ING A BUILT-IN EDITOR IS NOW AVAILABLE. THIS MAKES IT 
POSSIBLE TO DO A GREAT DEAL THAT COULD NOT BE DONE OTHER- 
WISE. 

THE LEGAL PITCH RANGE HAS BEEN REDUCED TO A MORE REASON- 
ABLE ONE. 

THE P COMMAND NOW ALLOWS A CHOICE OF FONTS (E. G. , FOR DY- 
NAMICS). 

A RUNNING FOOTER COMMAND NOW ALLOWS SPECIFYING A LINE TC 
BE WRITTEN ON EVERY PAGE OF MUSIC (AFTER THE FIRST). 
PERFORMANCE DIRECTIONS CAN GO UP TO SIZE 5 INSTEAD OF 4. 
CONTROL OF BEAM EXTENT IS NOW POSSIBLE WITH '* BEAM' . 
THERE ARE NEW CONTROL COMMANDS '* SET', '* IFEQ ' , AND 
'* END IF ' FOR CONDITIONAL EXECUTION. 

CONTROL OF SLUR CURVATURE AND DIRECTION IS NOW POSSIBLE. 
SQUARE, DIAMOND, 'X' SHAPED AND OMITTED NOTEHEADS ARE NOW 
A VA I LAB LE 

AN 'M' (MISCELLANEOUS SYMBOL) COMMAND CAN SPECIFY CERTAIN 
SYMBOLS INDEPENDENT OF A NOTE OR REST, OR LEAVE HORIZONTAL 
SPACE. 

AN 'E' COMMAND CAN BE USED TO GIVE COMMANDS TO THE EDITING 
VERSION OF READ4 DIRECTLY FROM THE REGULAR SMUT INPUT 
FILE. 

A NEW INITIALIZATION LINE CAN SPECIFY HOW MANY BARS TO PUT 
IN EACH SYSTEM OF MUSIC. 

THE 'A' (ARTIFICIAL GROUP) COMMAND HAS A NEW OPTION TO 
SUPPRESS PRINTING OF ITS ACCESSORY NUMERAL. 

LAYOUT LINE OPTIONS HAVE BEEN ADDED TO RUN ONLY PART OF 
THE PROGRAM AT A TIME. THIS IS IN SUPPORT OF EXTERNAL 


EDITING. 

THERE ARE NEW CONTROL COMMANDS TO SET TIME USED OR REMAIN- 


ING IN A MEASURE FOR USE WITH ANACRUSES, NONSTANDARD NOTA- 
TION. ETC. THIS FUNCTION FORMERLY WAS PERFORMED BY THE 
TNOW PARAMETER ON THE LAYOUT LINE, WHICH IS NOW GONE. 
INTEGERS ARE NOW READ WITH TRAILING BLANKS IGNORED INSTEAD 
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C OF BEING TREATED AS ZEROS. 

C 35. THERE ARE NEW CONTROL COMMANDS TO TURN JUSTIFICATION ON 
C AND OFF AND TO GIVE BETTER CONTROL OF HORIZONTAL SPACING. 

C 36. THE REST COMMAND HAS AN ' INVISIBILITY ' OPTION. 

C 37. THE 'S' COMMAND NOW HAS A PARAMETER TO CONTROL VERTICAL 

C SPACING AND A 'START NEW PAGE' OPTION. 

C 38. SEVERAL PARAMETERS HAVE BEEN ADDED TO THE 'I' COMMAND SO 
C IT NOW CONTROLS STAFF INDENTING AS WELL AS IDENTIFI— 

C CATION, AND CONTROL OF THE SIZE OF AND FONT OF THE IDENT. 

C IS NOW POSSIBLE. 

C 39. A NEW '* REPKS ' CONTROL COMMAND AFFECTS REPETITION OF THE 
C OLD KEY SIGNATURE AT THE BEGINNING OF THE STAFF WHEN THE 

C KEY SIGNATURE CHANGES IN THE FIRST MEASURE OF THE STAFF, 

C 40. THE STAFF AND NOSTAFF CONTROL COMMANDS HAVE BEEN RE- 
C NAMED SYST AND NOSYST , 

C 41. A '* SHARE' CONTROL COMMAND HAS BEEN ADDED FOR SPECIAL 
C EFFECTS. 

C 42. EXPLICIT BEGINNING AND END POINTS FOR BEAMS CAN BE SPEC- 
C IFIED. 

C 43. THERE IS NOW A PARAMETER TO SET THE BOTTOM MARGIN ON THE 
C LAYOUT LINE. 

C MANY MINOR IMPROVEMENTS HAVE ALSO BEEN MADE IN THE LISTING, 

C ERROR CHECKING, GRAPHIC QUALITY OF THE NOTATION, ETC. 

C 
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LAYOUT AND INITIALIZATION LINES 

DATA FOR A SMUT RUN CONSISTS OF EXACTLY ONE LAYOUT LINE, A 
SMALL NUMBER OF INITIALIZATION LINES, AND ANY NUMBER OF MUSIC 
DATA LINES, IN THAT ORDER. 


A. THE LAYOUT LINE DESCRIBES THE PAGE FORMAT. 

(ALL DISTANCES, LENGTHS, ETC. ARE GIVEN IN INCHES) 

NORMAL 


COL. FORMAT 

NAME 

I NFORM AT I ON DEFAULT 

1 

Ai 

IEORL 

'L'=NORMAL, 'E'=EDIT MODE (SEE BELOW) 


2 

Ai 

IFSCOR 

IF EDIT MODE, 'F'=RUN PASS IV ONLY 
(REQUIRES AUXILIARY FILES). IN BOTH 
MODES. 'P'=DO A SET OF PARTS, 'S'=DO 
A SCORE. 'P' MAY ALSO BE USED FOR A 

GROUP OF UNRELATED 1- VOICE EXAMPLES. 

s 

3-6 

14 

LISTC 

LISTING CONTROL ( NEGAT I VE=SHORT LIST- 
ING. , . 0=N0RMAL, PRINTS BACK INPUT. . . 
POSITIVE=NORMAL PLUS DEBUG PRINT). 

(NONE) 

7-11 

F5. 2 

RESOL 

RESOLUTION OF THE PLOTTER USED 
(SEE BELOW) 

. 005 

12-16 

F5. 2 

HS 

HEIGHT OF EACH STAFF 

, 24 

17-21 

F5. 2 

XLN 

LENGTH OF THE STAFF 

7> 70 

22-26 

FS. 2 

YH 

Y-COORDINATE OF THE BOTTOM LINE OF 
THE TOP STAFF ON EACH PAGE AFTER 1ST 

9. 60 

27-31 

F5. 2 

YDELT 

VERTICAL DISTANCE BETWEEN STAVES, 
BOTTOM TO BOTTOM 

. 96 

32-36 

F5. 2 

YMIN 

THE LOWEST POSSIBLE POSIION FOR THE 
BOTTOM OF A STAFF, VERY ROUGHLY 
SPEAKING, THIS IS THE BOTTOM MARGIN. 

. 4 

37-41 

15 

MAXERR 

MAXIMUM NO. OF NONFATAL ERRORS 

25 

42 

Al 

MODPUB 

BLANK=NORMAL. 'P '=PUBLICATION QUAL- 
ITY DESIRED. SEE BELOW. 

(NONE) 

43-44 

12 

LINWID 

USED WITH RESOL TO DETERMINE 
LINEWIDTH. SEE BELOW. 

10 


INTERNALLY, SMUT CONSISTS OF FOUR 'PASSES', THE LAST OF WHICH 
ACTUALLY DRAWS THE MUSIC (SEE 'PROGRAM OPERATION' BELOW FOR MORE 
DETAILS). IN NORMAL MODE. ALL FOUR PASSES ARE RUN. IN EDIT MODE, 
IF IFSCQR= 'S ' OR 'P ' ONLY THE FIRST THREE ARE RUN AND TWO INTER- 
MEDIATE FILES DESCRIBING THE SCORE OR SET OF PARTS ARE GENERATED. 

IN EDIT MODE, IF IFSCOR='F', ONLY PASS IV IS RUN, AND THE TWO 
INTERMEDIATE FILES (PRESUMABLY GENERATED BY A PREVIOUS RUN) MUST 
BE AVAILABLE. AGAIN, SEE 'PROGRAM OPERATION' FOR MORE DETAILS. 

THE TABLE ABOVE HAS THE HEADING 'NORMAL' DEFAULT BECAUSE, WHEN 
PASS IV ONLY IS RUN, DEFAULTS ARE TAKEN FROM THE LAYOUT LINE USED 
WHEN PASSES I THRU III WERE RUN. IN ANY CftSE, THE DEFAULT VALUE. 
WHERE THERE IS ONE. WILL BE SUBSTITUTED FOR ANY PARAMETER THAT IS 
BLANK. IN ADDITION. VALUES ARE RESTRICTED AS INDICATED IN THE DE- 
SCRIPTION OF ERROR MESSAGE F02 . 



RESOL AND LINWID TELL SMUT ABOUT THE PLOTTER TO BE USED SO 
IT CAN PRODUCE THE BEST QUALITY PLOT WITHOUT WASTING COMPUTER OR 
PLOTTER TIME. SPECIFICALLY, RESOL IS THE RESOLUTION OF THE PLOT- 
TER, E. G. . 005 FOR VERSATEC 1200, , 002 FOR CALCOMP 1037, ETC. 

ORDINARILY. THE WIDTH OF A LINE ON A PLOTTER IS THE SAME AS ITS 
RESOLUTION, BUT ON A MECHANICAL (PEN-AND-INK) PLOTTER SUCH AS THE 
CALCOMP 1037, MOST PENS ARE WIDER THAN THIS. SMUT USES THE LINE- 
WIDTH AS THE SPACING BETWEEN THE PARALLEL LINES IT DRAWS TO BLACK- 
EN MANY SYMBOLS IN. SO LINWID CAN BE USED TO SPECIFY A LINE- 
WIDTH LARGER THAN THE RESOLUTION VIA THE FORMULA 
LINEWIDTH => RESOL*. 1*ABS (LINWID) . 

FINALLY, IF MODPUB IS 'P'. SMUT TAKES EXTRA CARE IN DRAWING SOLID 
NOTEHEADS AND IN THICKENING EVERYTHING (ESPECIALLY SLURS, CLEFS. 
AND QUARTER RESTS) IN AN EFFORT TO PRODUCE THE BEST POSSIBLE PLOT 
QUALITY. THIS SLOWS THE PROGRAM DOWN AND IS LIKELY TO HELP ONLY 
ON H I GH-RESOLUT I ON VECTOR -DR AWING PLOTTERS, ESPECIALLY PEN-AND-INK 
PLOTTERS. 

SMUT INSISTS ON HS BEING AT LEAST 15 TIMES AS LARGE AS 
RESOL . ACTUALLY, FOR READABLE RESULTS, IT SHOULD BE AT LEAST 20 
OR. BETTER. 30 TIMES AS LARGE. 

SMUT RELIES ON YMIN TO PREVENT THE PLOT FROM GOING OFF THE 
BOTTOM OF THE PAGE. THIS IS BY NO MEANS FOOLPROOF, ESPECIALLY IF 
THE MUSIC HAS VERY LOW NOTES OR PERFORMANCE DIRECTIONS WELL BELOW 
THE STAFF, EXPERIMENTATION WITH YMIN MAY BE NECESSARY FOR BEST 
RESULTS. 

SPACING BETWEEN SYSTEMS IN A SCORE IS AFFECTED BOTH BY YDELT 
AND BY THE NUMBER OF STAVES. HOWEVER, IT WILL ALWAYS BE SOME- 
WHERE BETWEEN (NO. OF ST A VES ) * YDELT AND (NO. OF STAVES+1 )*YDELT 
— I. E. , UP TO YDELT MORE SPACE IS LEFT BETWEEN SYSTEMS THAN BE- 
TWEEN STAVES WITHIN A SYSTEM. SPECIFICS APPEAR IN THE FOLLOWING 
TABLE. 


NO. OF STAVES 
1 
2 

3 

4 

5 OR MORE 


SPACE BETWEEN SYSTEMS 
1* YDELT 

2. 2 5* YDELT 

3. 5*YDELT 

4. 75*YDELT 

(NO. OF STAVES+1 >*YDELT 


IT IS POSSIBLE, ALTHOUGH SOMEWHAT MESSY, TO COMPUTE FROM THIS HOW 
MANY SYSTEMS SMUT WILL PUT ON A PAGE, GIVEN THE NUMBER OF STAVES 
IN THE SYSTEM. YDELT , AND YH . 


FOLLOWING ARE SEVERAL SAMPLE LAYOUT LINES, WITH NAMES OF OUTPUT 
DEVICES THEY MIGHT BE SUITABLE FOR. 



1-8 


L. LISTRESQLHS. . . XLN. , YH. . . YDELTYMIN. MAXERMLW OUTPUT DEVICE 


LS 

0 

. 005 . 24 

7, 70 

9. 60 

,96 ,40 

20 00 

(DEFAULTS) 

LP 

0 

. 010 . 60 

6. 60 

4. 8 

1. 5 

-1 00 

UEI+ADM3A/TEK, BIG 

LS 

0 

. 010 . 40 

6 . 60 

4. Q 

X. 2 

00 

DEI+ADM3A/TEK. MED. 

LS 

0 

,05 l.B 

32. 0 

a. o 

4. 0 

-1 00 

LINE PRINTER, BIG 

LS 

0 

.05 1.0 

31. 5 

9. 2 

2. 7 

-1 00 

LINE PRINTER, MED. 

LS 

“1 

002 




P15 

CALCQMP 1037, NARRDW 

LS 

-I 

. 002 . 27 

10. 2 

12 . a 

. 94 

P15 

CALCOMP 1037, WIDE 

LS 

-1 

. 002 . 31 

11. 5 

14. 4 

1. 05 

P 15 

CALCDMP 1037, 67PCT. 

LS 

0 

. 005 




00 

VERSATEC 1200 

LS 

0 

. 005 . 20 



. 80 

00 

VERSATEC 1200, SMALL 


B. INITIALIZATION LINES ARE EITHER 'H' (HEADER). 'F' (FOOTER). 'R ' 
(RUNNING FOOTER), 'G ' (GROUP VOICES), 'B ' (BARS PER SYSTEM), 'C ' 
(COMMENT). OR 'X' (END INITIALIZATION LINES AND BEGIN MUSIC DATA), 


1. HEADER/FOOTER LINES TELL SMUT WHAT, IF ANYTHING, TO WRITE 
AT THE TOP OF THE FIRST PAGE OF MUSIC (TYPICALLY THE TITLE OF 
THE PIECE AND THE COMPOSER 'S NAME) AND AT THE BOTTOM OF THE 
FIRST PAGE (TYPICALLY A COPYRIGHT NOTICE AND/OR GENERAL PER- 
FORMANCE INSTRUCTIONS). THE FORMAT OF EACH LINE IS — 

COL. INFORMATION 

1 'H' (HEADER LINE, INFORMATION TO GO AT TOP OF PAGE) OR 
'F' (FOOTER LINE, TO GO AT BOTTOM OF PAGE). 

2 A DIGIT FROM 1 THRU 9 GIVING THE SIZE OF THE CHARACTERS 
TO BE USED FOR THIS LINE, RELATIVE TO THE HEIGHT OF THE 
STAFF. 1 TO 4 ARE THE SAME AS PERFORMANCE DIRECTION 
SIZES l TO 4, 8 MEANS CHARACTERS THE SAME HEIGHT AS THE 
STAFF, 9 IS EVEN LARGER. RECOMMENDED SIZES ARE. . . 

IN SCORE IN PARTS 

TITLE 7 OR 8 6 

COMPOSER'S NAME, ETC. 5 OR & 4 

3-80 ON 'H' AND 'F ' LINES, SPECIFICATION OF FROM ZERO TO 
THREE SUBLINES, EACH PRECEDED BY ONE OR MORE BLANKS. 

THE NOTATION OF A SUBLINE IS BEST DESCRIBED WITH AN EX- 
AMPLE — 

R 'BENJAMIN BRITTEN' 

MEANS, WRITE 'BENJAMIN BRITTEN' RIGHT- JUSTIFIED ON THE 
LINE. BESIDES 'R ', POSITION CODES ARE 'L' (LEFT- 
JUSTIFY) AND 'C ' (CENTER). THUS, A COMPLETE LINE MIGHT 
BE— — 

H7 L 'WILFRED OWEN' E 'BENJAMIN BRITTEN' 

I.E. , WRITE AT THE TOP OF THE PAGE IN RATHER LARGE LET- 
TERS A LINE WITH 'WILFRED OWEN' FLUSH AGAINST THE LEFT 
MARGIN AND 'BENJAMIN BRITTEN' FLUSH AGAINST THE RIGHT. 
ANY CHARACTER NOT APPEARING IN THE SUBLINE CAN BE USED 
TO DELIMIT ITS BEGINNING AND END, SO IT IS POSSIBLE TO 
HAVE TITLES THAT INCLUDE THE ' —FOR EXAMPLE, 



C/MILES' MODE/ 


1-9 


AN WHERE A POSITION CODE IS EXPECTED TELLS SMUT TO 
IGNORE THE REST OF THE LINE. THIS ALLOWS (E. G. ) PUTTING 
SEQUENCE NUMBERS ON HEADER /FOOTER LINES. 

NO MORE THAN 12 HEADER AND FOOTER LINES MAY APPEAR. 


2. THE 'R' (RUNNING FOOTER) LINE GIVES INFORMATION TO BE WRITTEN 
AT THE BOTTOM OF EVERY PAGE AFTER THE FIRST. ITS FORMAT IS 
IDENTICAL TO THAT OF A HEADER/FOOTER LINE, EXCEPT THAT A RIGHT- 
JUSTIFIED SUBLINE SHOULD NOT BE USED, SINCE THE PAGE NUMBER IS 
WRITTEN AT THE RIGHT END OF THE RUNNING FOOTER LINE. SIZE 4 IS 
RECOMMENDED. ONLY ONE RUNNING FOOTER LINE SHOULD APPEAR. 


3. A 'G' LINE TELLS SMUT HOW TO GROUP VOICES IN A SCORE, IN- 
CLUDING WHICH VOICES SHARE STAVES. IT CONTAINS A 'G ' IN COLUMN 
1, AND COLUMNS 2 THRU 80 CONTAIN GROUPING INFORMATION. ONE OR 
MORE BLANKS INDICATE THE END OF A GROUP. THE SPECIFICATION OF 
EACH GROUP IS EITHER A SERIES OF 'l'S AND '2'S, WHERE '1' MEANS 
UNSHARED AND '2' MEANS SHARED STAFF. OR IS A DIGIT 3 THRU 9 
GIVING THE NUMBER OF UNSHARED STAVES. FOR EXAMPLE 
G 4 3 

INDICATES FOUR VOICES IN THE FIRST GROUP AND THREE IN THE SECOND. 
EACH WITH ITS OWN STAFF. 

G 22 

INDICATES A SINGLE GROUP OF FOUR VOICES ON TWO SHARED STAVES. 

E. G. FOR A CHORALE. FINALLY, 

G 1222 22 1 5 

MEANS FOUR GROUPS. RESPECTIVELY WITH SEVEN, FOUR. ONE. AND FIVE 
VOICES. VOICE 1 HAS ITS OWN STAFF, VOICES 2 AND 3 SHARE A STAFF, 
VOICES 4 AND 5 SHARE ANOTHER, ETC. - THIS MIGHT BE A MOZART ORCH- 
ESTRA. THE SAME THING COULD BE WRITTEN 
G 1222 22 1 11111 

ANY GROUPING SPECIFIED FOR MORE VOICES THAN THERE IS DATA FOR IS 
IGNORED. 

VOICE GROUPING AFFECTS HOW BARLINES ARE DRAWN - THEY ARE INTER- 
RUPTED BETWEEN GROUPS - AND HANDLING OF CONDITIONAL PERFORMANCE 
DIRECTIONS, WHICH ARE WRITTEN ONLY FOR THE TOP OR ONLY VOICE OF 
EACH GROUP. 

VOICES SHARING A STAFF HAVE THEIR STEMS POINTED OUTWARDS. SLURS 
AND GROUPET ACCESSORY NUMERALS PUT ON THE OUTSIDE, ETC. THEIR 
CLEFS, KEY SIGNATURES, AND TIME SIGNATURES MUST ALWAYS AGREE. 

N. B. SMUT 2.9 IS TOTALLY IGNORANT OF THE DANGER OF VOICES SHARING 
A STAFF COLLIDING. IN PARTICULAR, WHEN THEIP. NOTES AP.E A SECOND 
APART OR IN UNISON. THE HEADS WILL OVERLAP OR BE SUPERIMPOSED, 

AND WHEN BOTH HAVE ACCIDENTALS AND ARE WITHIN A FOURTH, THE AC- 
CIDENTALS WILL OVERLAP. 

DEFAULT WHEN DOING A SCORE IS THAT ALL VOICES FORM ONE GROUP AND 
EACH HAS ITS OWN STAFF. 'G' LINES ARE ILLEGAL WHEN DOING PARTS. 
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4. 'B' (BAR-PER-SYSTEM) LINES CONTAIN IN COLUMNS 2 THRU SO A 

SERIES OF UP TO 60 DIGITS SPECIFYING HOW MANY BARS ARE TO BE 
PUT IN EACH SYSTEM OF MUSIC. BLANKS ARE IGNORED. AN '=' MEANS 
THE THE END OF THE LIST. OCCURENCE OF A 'B ' LINE PERMANENTLY 
TURNS OFF SMUT'S NORMAL MECHANISM FOR DECIDINC HOW MANY BARS TO 
PUT IN EACH SYSTEM (ALTHOUGH IT WILL STILL TRY TO PREVENT DIS- 
ASTEROUS OVERCROWDING). FOR EXAMPLE. 

B 2222333 -OR- B 2222 333 

SPECIFIES 2 BARS IN EACH OF THE FIRST 4 SYSTEMS AND 3 BARS IN 
EACH OF THE NEXT 3. IF MORE SYSTEMS ARE REQUIRED. I. E. IF THERE 
ARE MORE THAN 17 BARS. THE REST OF THE MUSIC WILL BE SQUEEZED 
UNMERCIFULLY. 


5. COMMENT LINES ARE PRINTED OUT BUT IGNORED. THEY CAN CONTAIN 
ANYTHING IN COLUMNS 2-BO. 


TO DEMONSTRATE ALL THIS. A LEGITIMATE RUN MIGHT BEGIN LIKE THIS. , . 

LP (LAYOUT LINE) 

H7 C/SYEEDA'S SONG FLUTE/ 

H5 R 'JOHN COLTRANE' 

X 

S 1 1 O 4 4 = (BEGIN MUSIC DATA) 


OR LIKE THIS. . . 

LS -1 . 002 .20 75 (LAYOUT LINE) 

H8 C 'WAR REQUIEM' (BEGIN INIT. LINES) 

H6 L 'WILFRED OWEN' R 'BENJAMIN BRITTEN. OP. 66' 

H6 (TO LEAVE SPACE) 

H7 C 'REQUIEM AETURNUM ' 

F4 C 'COPYRIGHT 1962 BY BOOSEY AND HAWKES' 

R4 L 'WAR REQUIEM' 

G 122122 222 325 
X 

S 1 1 -1 5 4 


(END INIT. LINES) 
(BEGIN MUSIC DATA) 
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MUSIC DATA LINES 

N. B. THIS SECTION GIVES THE EXACT FORMAT OF SMUT MUSIC-DESCRIB- 
ING COMMANDS. SINCE THESE ARE GENERATED AUTOMATICALLY BY SMIRK 
AND BY JANUS , THE PROGRAMS THAT INTERFACE MUSTRAN AND COMPOSING 
PROGRAMS TO SMUT , USERS OF SMIRK OR JANUS NEED NOT PAY TOO 
MUCH ATTENTION TO THIS SECTION. 

C EACH DATA LINE COR LINE IMAGE) IS DIVIDED INTO FOUR 30-COLUMN 
C DIVISIONS. A COMMAND MAY INCLUDE 1. 2, OR 3 OF THESE DI- 

C VISIONS. COLUMN 1 ALWAYS BEGINS A COMMAND. A '$ ' OR SEMICOLON 

C IN COLUMN 20 INDICATES A NEW COMMAND BEGINS IN 22, AND SIMILAR- 
C LY FOR COLUMNS 40-41 AND 60-61. AN '=' IN COLUMN 20, 40, OR 

C 60 INDICATES THE REST OF THE LINE IS TO BE IGNORED. 

C THE TYPE OF INFORMATION IN A COMMAND AND HOW IT IS FORMATTED IS 
C DETERMINED BY THE 1ST CHARACTER OR OP CODE (IN COLUMN 1, 21, 

C 41, OR 61). AS FOLLOWS - 
C 


c 

OP CODE 

COMMAND 


NO. OF DIVS. , FORMAT 

U 

c 

N 

NOTE 


(1)413. 3A2 (2)413, 5A2 

c 

G 

GRACE NOTE 


(1) 413 

c 

R 

REST 


(1) 313, A2 

c 

D 

BARLINE 


(1) 13 

c 

P 

PERFORMANCE DIRECTION 


( 1 )3I3, 2A4, A1 (2)313, 

c 




7A4, A1 (3)313, 12A4 

c 

A 

ARTIFICIAL GROUP 


(1) 313 

c 

S 

STAFF-CLEF-KEY SIGNATURE-METER (1) 713 

c 

I 

IDENTIFICATION 


(1)13, 4A4 (2) 13, SA4 

c 


CONTROL 


(1) 13. 2A4 

c 

E 

EDIT 


( >11, 12, Al, 15, 2(1X, 12) 

c 




9A5.3A2 

c 

M 

MISCELLANEOUS SYMBOL 


( 1 ) 13, A2 

c 

U 

CALL USER ROUTINE 


(1) 213, 14, 13 

c 

T 

OCTAVA 


(1) 213 

c 

X 

END— OF— VOICE 


(1) Al 

c 

(BLANK) 

NO-OP 



c 

c 

ANYTHING 

ELSE IS ILLEGAL. 



c 

c 

E X P L - 

ANATION OF 

PARA 

METERS 

u 

C 

N. B. GENERALLY A STATEMENT THAT SOMETHING 'CANNOT' BE DONE 

c 

MEANS IF 

IT IS, AN ERROR MESSAGE WILL 

BE ISSUED. A STATEMENT 

c 

THAT SOMETHING 'SHOULD NOT' BE 

DONE USUALLY MEANS THAT IF IT 

c 

IS, SMUT 

WILL NOT CATCH IT AND 

BAD THINGS MAY HAPPEN. 

V* 

c 

N. B. THE 

' I ' FORMAT USED HERE 

IS LIKE 

FORTRAN 77, NOT FORTRAN 

c 

dot IN Ti 

HAT FORTRAN oo TREATS 

TRAILING BLANKS LIKE ZEROS, 

c 

WHEREAS 

SMUT AND FORTRAN 77 IGNORE THEM (WHICH MAKES FAR MORE 

c 

SENSE ) , 





C 
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C 

C 

r 

C 

C 

C 

r* 

u 

c 

c 

c 

c 

c 

c 

c 

c 

c 

v. 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


NAME FORMAT MEANING 

** NOTE (N) COMMAND ■»* 

NOTF 13 THE MEANING OF NOTP AND JQP DEPENDS ON WHETHER 
CHROM IS IN EFFECT. IF SO. NOTP IS AN INTEGER, 
OPTIONALLY SIGNED, REPRESENTING THE NOTE'S PITCH IN 
SEMITONES RELATIVE TO THE PIANO'S LOW A. N0TP=0. 

THUS, MIDDLE C IS 39, THE PIANO'S TOP C B7, AND SO 
ON. LEGAL RANGE IS -9 . LE. NOTP . LE. 99 (AFTER 
TRANSPOSITION). IF NOCHROM , NOTP GIVES THE 
NUMBER OF THE STAFF LINE OR SPACE THE NOTE SHOULD 
APPEAR ON, WITH 29=MIDDLE C (REGARDLESS OF CLEF), 
HENCE, IN THIS CASE THE PIANO'S LOW A IS 6 AND IT5 
TOP C IS 57, 

JGP 13 UNDER CHROM , A VALUE OF 1 INDICATES QUARTER-TONE 

DOWN. 2 QUARTER-TONE UP FROM THE PITCH GIVEN BY 
NOTP . IF NOCHROM , THE ONE'S DIGIT IS AS JUST 
DESCRIBED, WHILE TEN'S DIGIT=1 MEANS DOUBLE FLAT, 2 
MEANS FLAT, 3 MEANS NATURAL, 4 SHARP, AND 5 DOUBLE 
SHARP, 

NDURB 13 DURATION. IF NDURB AND IDT ARE BOTH ZERO, THE 
DURATION OF THE PREVIOUS NOTE READ IS REPEATED. 
OTHERWISE, THE MEANING OF NDURB AND IDT DEPENDS 
ON WHETHER NORHDEC OR RHDEC IS IN EFFECT. WITH 
NORHDEC , NDURB IS THE RECIPROCAL OF THE BASIC DUR- 
ATION AND IDT THE NUMBER OF DOTS. WITH RHDEC . 
NDURB IS A NO. OF 12STH NOTES AND IDT A NO. OF 
WHOLE NOTES TO ADD TOGETHER TO DETERMINE THE DUR- 
ATION. BASIC DURATIONS ARE LISTED WITH THEIR EQUIV- 
ALENTS BELOW. 

WITH NORHDEC WITH RHDEC 

NOTE VALUE NDURB MAX. OF IDT NDURB IDT 


C 

C 

C 

C 

C 

c 

c 

c 

c 

c 

c 

c 

c IDT 
C KSYM 
C 
C 

c 

C 

c 

c 

c 

c 


DOUBLE LONG 

— s 

10 

0 

e 

LONG 

-4 

9 

0 

4 

BREVE 

-2 

a 

0 

2 

WHOLE (SEMIBREVE) 

1 

7 

0 

1 

HALF 

2 

6 

64 

0 

QUARTER 

4 

5 

32 

0 

EIGHTH 

S 

4 

16 

0 

16TH 

16 

3 

S 

0 

32ND 

32 

2 

4 

0 

64TH 

64 

1 

2 

0 

I2STH 

128 

0 

X 

0 


13 SEE EXPLANATION OF NDURB ABOVE. 

5A2 EACH OF THE FIVE FIELDS INDICATES A SYMBOL ASSOC- 
IATED WITH THE NOTE. ALL ES'S FOR A MOTE SHOULD 
APPEAR FIRST, THEN ALL SL'S, SU'S, AND SD'S. THEN 
A TI (IF PRESENT), THEN EVERYTHING ELSE. ES AND 
SL CANNOT BOTH APPEAR ON A DECOMPOSED NOTE. IN 
TERMS OF POSITIONING, SYMBOLS FALL INTO FOUR CAT- 
EGORIES - THOSE THAT ARE ALIGNED WITH THE NOTEHEAD 
AND ALWAYS GO ABOVE THE STAFF (MARKED 'A' BELOW), 
THOSE THAT ARE ALIGNED WITH THE NOTEHEAD AND CAN GO 
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C ABOVE OR BELOW THE HEAD OR STEM (NO MARKING). THOSE 

C THAT FOLLOW THE NOTE (MARKED 'F ' ) » AND MISCELLANEOUS 

C UNALIGNED SYMBOLS (MARKED 'M'), THOSE IN THE 2ND 

C CATEGORY ARE DRAWN WITH THE 1ST INNERMOST (CLOSEST 

C TO THE NOTEHEAD), THEN THE 2ND, ETC. 


C CODE CAT. MEANING 


C 

C 

C 

C 

C 

C 

C 

c 

c 

c 

n 

u 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

n 

u 

c 

c 

c 

c 

c 

c 

c 

c 

c 

n 

L. 

c 

c 

c 

c 


ES 

SL 

SU 


SD 


TI 

0 

1 

2 

3 

4 

5 
A 

AH 


V A 

D A 

FM A 

TR A 

TW A 

MR A 

, F 

// F 

TM M 


END SLUR. UP TO 3 SLURS (AND TIES) MAY 
BE NESTED. 

BEGIN 5LUR. 

BEGIN SLUR CURVING UP WITH CURVATURE 
GIVEN BY THE LEFTMOST DIGIT OF THE NEXT 
FIELD. 

BEGIN SLUR CURVING DOWN WITH CURVATURE 
GIVEN BY THE LEFTMOST DIGIT OF THE NEXT 
PTELD. 

TIE TO FOLLOWING NOTE. 

INDICATES NUMBER ZERO. 

NUMBER '1 
NUMBER '2 ' 

NUMBER '3', 

NUMBER '4'. 

NUMBER ' 5 *. 

AN ACCENT MARK. 

A HEAVY ACCENT (LIKE FRENCH CIRCUM- 
FLEX ) . 

STACCATO DOT. 

TENUTO LINE. 

A 

A 'V' (UPBOW SIGN). 

A DOWNBOW SIGN. 

FERMATA (HOLD) 

LETTERS 'TR', 

A WIGGLY LINE (BAROGUE TRILL) 

MORDENT. 

BREATH MARK AFTER NOTE. 

PAUSE MARK .AFTER NOTE. 

TREMOLO. WHEN A FIELD CONTAINS TM . 
THE SUCCEEDING FIELD MUST INDICATE THE 
RHYTHM OF THE TREMOLO. THIS IS THE 
LEFT-JUSTIFIED NO. OF SLASHES REQUIRED, 
SO THAT DIVIDING A NON-DOTTED NOTE 
INTO S WOULD BE INDICATED BY '3 TM 

SHOULD NOT APPEAR MORE THAN ONCE ON A 
NOTE. 


GL F GLIS5AND0. THE NEXT FIELD MUST CON- 
TAIN THE SEMITONE PITCH TO BE SLID TO, 
THE NEXT FIELD AFTER THAT THE QUARTER- 
TONE INDICATOR. 

BN M BEGIN BEAMING HERE. 

EB M END BEAMING HERE. 

REGARDLESS OF WHETHER AUTO-BEAMING IS IN EFFECT 
SEE THE '* BEAM' COMMAND), SMUT ATTEMPTS TO BEAM 
THE NOTES AND GRACE NOTES FROM THE ONE WITH 'BM' 
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C 

C 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

r 

w 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

r* 

■u 

c 

c 

u 

c 

c 

v* 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

’U 

c 

c 


THROUGH THE ONE WITH 'EB' WITHOUT BREAKS DUE TO 
METER. HOWEVER, THE 'BM' AND 'EB ' MUST BOTH BE 
WITHIN THE SAME BAR - BEAMING ACROSS BARLINES IS 
NOT POSSIBLE. 

H3 M SQUARE NOTEHEAD. 

H4 M DIAMOND-SHAPED NOTEHEAD. 

H6 M 'X' SHAPED NOTEHEAD. 

H7 M OMIT NOTEHEAD ENTIRELY. IF THE NOTE 

HAS ACCIDENTALS OR DOTS, THEY ARE NOT 
OMITTED. 

(BLANK) INDICATES NO SYMBOL. 

N. B. , AND // SHOULD NOT BE SPECIFIED WITH THE 
SAME NOTE. 

*•» GRACE NOTE (G) COMMAND ** 

NOTP 13 SAME AS IN NOTE COMMANDS. 

JQP 13 SAME AS IN NOTE COMMANDS. 

NDURG 13 SAME AS IN NOTE COMMANDS, EXCEPT THE LONGEST LEGAL 

DURATION IS A HALF NOTE. 

ISHG 13 (NOT USED IN THIS VERSION. ) 

REST (R) COMMAND ** 

NOTP 13 -99 MEANS 'INVISIBLE REST'. OTHERWISE, NOTP IS 

THE NUMBER OF HALF-SPACES BY WHICH THE REST SHOULD 
BE MOVED FROM ITS NORMAL POSITION, E. G. -6 WOULD 
LEAVE A WHOLE REST HANGING FROM THE BOTTOM LINE. 
ODD VALUES ARE NOT RECOMMENDED, SINCE THEY COULD 
MAKE WHOLE RESTS LOOK LIKE HALF RESTS AND VICE 
VERSA. 

NDURB 13 SAME AS IN NOTE COMMANDS, EXCEPT THAT -1 MEANS A 
CENTERED WHOLE REST FILLING A BAR, REGARDLESS OF 
METER, IF USED AT THE BEGINNING OF A BAR — ELSE IT 
CALLS FOR WHATEVER RESTS ARE NEEDED TO COMPLETE 
THE BAR. ALSO -N , FOR N , GT. 1. MEANS A 'MULTI- 
BAR' REST OF N BARS DURATION— A LOGICAL BARLINE 
SHOULD APPEAR ON EACH SIDE OF A MULTIBAR REST WITH 
NOTHING IN BETWEEN. IF DOING A SCORE, IT GEN- 
ERATES N WHOLE RESTS SEPARATED BY 3ARLINES, IF 
DOING PARTS, IT GENERATES A THICK HORIZONTAL BAR 
WITH THE NUMBER N ABOVE IT, 

IDT 13 SAME AS IN NOTE COMMANDS. IGNORED WITH MULTIBAR 

RESTS. 

NDURB AND IDT BOTH ZERO CAUSES THE DURATION OF 
THE LAST REST (NOT NOTE) TO BE USED. 

KRSYM A2 A SYMBOL ASSOCIATED WITH THE REST. THOSE MARKED 
'F' BELOW FOLLOW THE REST, THOSE MARKED 'A' ARE 
ALIGNED WITH IT AND ABOVE THE STAFF (THESE SHOULD 
NOT BE USED WITH MULTIBAR RESTS). 

CODE CAT. MEANING 

FM A FERMATA (HOLD) 

, F BREATH MARK AFTER REST 

// F PAUSE MARK AFTER REST 
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C 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

/* 

Ih> 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

r* 

V* 

c 

c 

c 

c 

c 

c 

c 

c 


** BARLINE (B) COMMAND ** 

KB 13 IF KB , LT. 0, THE TYPE OF BARLINE SPECIFIED BY 

THE VALUE OF KB IS DRAWN BUT THE BAR IS NOT 
LOGICALLY ENDED. IF KB , GE. 0 AND AUTOMATIC BAR- 
RING IS IN EFFECT, THE VALUE OF KB SPECIFIES THE 
TYPE OF THE NEXT AUTOMATICALLY GENERATED BARLINE. 

IF KB . GE. 0 AND AUTO-BARRING IS NOT IN EFFECT, 

THE TYPE OF BARLINE SPECIFIED IS DRAWN AND THE BAR 
IS ENDED. 

VALUES HAVE THESE MEANINGS. , . 

1 MEANS SINGLE BAR, 

2 LIGHT DOUBLE BAR, 

2 HEAVY DOUBLE BAR, 

4 HEAVY DOUBLE BAR WITH REPEAT SIGN BEFORE IT, 

5 HEAVY DOUBLE BAR WITH REPEAT SIGN AFTER IT, 

6 HEAVY DOUBLE BAR WITH REPEAT SIGN BEFORE+AFTER, 

7 DOTTED BARLINE, 

8 SHORT, LIGHT SINGLE BAR, 

0 'INVISIBLE BARLINE' (ACTS LOGICALLY LIKE A 
3ARLINE, FLUSHES BUFFERS, ETC. ) 

N. B. IF DIFFERENT VOICES IN A CONNECTED GROUP 

HAVE A DIFFERENT BARLINE TYPE, PECULIAR RESULTS 

ARE LIKELY. 

** PERFORMANCE DIRECTION (P) COMMAND ** 

IPARM1 II .EG. 1 INDICATES CONDITIONAL PERFORMANCE DIRECTION, 
WRITTEN ONLY IF THIS VOICE IS THE TOP OR ONLY 
VOICE IN A GROUP (SEE INITIALIZATION COMMANDS) , 

. EG. 0 INDICATES PERFORMANCE DIRECTION ALWAYS TO 
BE WRITTEN. 

IPARM2 12 ABSOLUTE VALUES 1 TO 5 INDICATE VERY SMALL TO VERY 
LARGE CHARACTERS. IF IPARM2 IS POSITIVE THEY 
WILL BE WRITTEN ABOVE THE STAFF, IF NEGATIVE BELOW. 
SIZE 3 IS BEST FOR MOST PURPOSES EXCEPT DYNAMIC 
LEVELS ('MP', 'FF', ETC. ), FOR WHICH SIZE 4 (AND 

FONT 1) IS SUITABLE. IF DOING A SCORE, CONDITION- 
AL PERFORMANCE DIRECTION SIZES WILL BE BOOSTED BY 
ONE. 

IPARM3 12 THE Y-DISPLACEMENT FROM THE NORMAL POSITION IN 

HALF-SPACES. THE NORMAL POSITION IS 5 HALF-SPACES 
ABOVE THE STAFF OR 9 HALF-SPACES BELOW. 

IPARM4 II FONT ( 0=N0RMAL, 1=DYNAMICS FONT, 2=ITALICS). N. B. 

ON CDC COMPUTERS, FONT 1 ASSUMES ALL ALPHABETIC 
CHARACTERS ARE UPPER CASE AND FORCES THEM INTO 
LOWER CASE. 

IPARM5 12 THE NUMBER OF CHARACTERS TO BE WRITTEN (MAX. 4B). 

12A4 THE LITERAL CHARACTERS TO BE WRITTEN. STARTING 

ABOVE OR BELOW THE NEXT NOTE OR REST. 

ARTIFICIAL GROUP OR GPOUPET (A) COMMAND 

NAG 13 THE GROUP'S TOTAL DURATION IN TERMS OF ITS BASIC 

UNIT, ALSO TAKEN AS THE ACCESSORY NUMERAL TO BE 
PRINTED. 

JGP 13 THE GROUP'S DURATIONAL UNIT. INDICATED IN THE SAME 

WAY AS NDURB FOR NOTES AND RESTS, EXCEPT THAT IT 
MUST BE . GE. 1 WITH NQRHDEC AND SHOULD BE . LE. 
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C 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

n 

V* 

c 

c 

c 

c 

c 

c 

c 

c 

c 

r* 

u 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


120 AND A POWER OF 2 WITH RHDEC . 

MOVNAG 13 . LE. -99 MEANS DO NOT PRINT ACCESSORY NUMERAL. 

OTHERWISE MOVNAG IS THE Y-DISPLACEMENT FOR THE 
ACCESSORY NUMERAL, IN HALF-SPACES. 

IN SIMPLE METERS, DURATIONS OF NOTES IN ARTIFICIAL 
GROUPS ARE ALWAYS REDUCED. IN COMPOUND METERS, 

THIS DEPENDS ON THE CONTROL COMMANDS SHRINK AND 
NOSHRINK (WHICH SEE), N. B. IF DESYNC IS ON. NO 
ARTIFICIAL GROUP STARTING OFF THE BEAT MAY CROSS 
THE FOLLOWING BEAT, SEE THE DESCRIPTION OF THE 
'S' COMMAND BELOW. 

STAFF-CLEF-KEY SIGNATURE-METER (S) COMMAND ** 

THE FIRST 'S' COMMAND FOR EACH VOICE SHOULD SPECIFY 
CLEF AND TIME SIGNATURE. AN INITIAL KEY SIGNATURE 
OF NO SHARPS OR FLATS IS ASSUMED. 

KIND 13 IF KIND . NE. O, START A NEW SYSTEM. KIND . GT. 0 
MEANS RIGHT-JUSTIFY THE OLD SYSTEM, , LT. 0 MEANS 
DO NOT. +OR- 1 IMPLIES A CONTINUATION OF THE SAME 
PIECE, SO THE OLD KEY SIGNATURE WILL BE REPEATED 
AND/OR CANCELLED AND MEASURE NUMBERS WILL CONTINUE. 
+OR— 2 IMPLIES STARTING A NEW PIECE, SO THE OLD KEY 
SIGNATURE WILL BE FORGOTTEN, MEASURE NOS. WILL 
RESTART AT 1, ETC. +OR- 3 IS THE SAME AS -M3R- 2, 
EXCEPT IT ALSO MEANS START A NEW PAGE. 

KIND-0 MEANS DO NOTHING. 

N. B. KIND = +OR- 2 OR 3 SHOULD NEVER BE USED IN 
SCORES. WHEN DOING A SCORE, IF ANY VOICE REQUESTS 
A NEW SYSTEM (I.E. , HAS KIND . NE. 0) ONE WILL BE 
STARTED. TWO DIFFERENT NONZERO VALUES OF KIND 
IN THE SAME BAR IN DIFFERENT VOICES SHOULD NEVER 
BE USED. STARTING A NEW SYSTEM ALWAYS IMPLIES 
ENDING THE CURRENT BAR. 

NUCLEF 13 INDICATES NEW CLEF AS FOLLOWS - 1 TREBLE, 2 SO- 

PRANO, 3 MEZZO-SOPRANO, 4 ALTO, 5 TENOR, 6 BAR- 
ITONE, 7 BASS. 0 MEANS DO NOT CHANGE CLEFS. 

KSAX 13 NEW KEY SIGNATURE, AS FOLLOWS - 

-7 , LE. KSAX . LT. 0 MEANS CHANGE TO -KSAX FLATS, 
KSAX=0 MEANS NO CHANGE OF KEY SIGNATURE, 

0 . LT. KSAX . LE. 7 MEANS CHANGE TO KSAX SHARPS, 

KSAX , GT. 7 MEANS CHANGE TO O SHARPS AND FLATS. 

FLAT AND SHARP KEY SIGNATURES RESPECTIVELY INVOKE 
'* FLAT' AND '* NOFLAT' CONTROL COMMANDS. 

MN 13 THE NUMERATOR OF THE NEW TIME SIGNATURE. IF MN=0, 

THE TIME SIG. DOES NOT CHANGE AND MD AND MSPEC ARE 
IGNORED. 

MD 13 THE DENOMINATOR OF THE TIME SIGNATURE. 

IF MN IS GREATER THAN 3 AND A MULTIPLE OF 3 (6, 

9, 12, ETC. ), THE METER IS CONSIDERED COMPOUND, 

OTHERWISE IT IS CONSIDERED SIMPLE. IN COMPOUND 
METERS, 3 OF THE UNITS SPECIFIED BY MD MAKE A 
BEAT, IN SIMPLE METERS ONE SUCH UNIT MAKES A BEAT. 

MSFEC 13 IF MSPEC , NE. 0, THE LOGICAL TIME SIGNATURE IS 

STILL GIVEN BY MN/MD , BUT WHAT IS WRITTEN ON THE 
MUSIC IS — 
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IF 

MSPEC 

. or. 

G, 

THEN MSPEC , 
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IF 

MSPEC 

> EG, 

-1, 

A SLASHED 'C', 
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. EG, 

-2, 

A 'C', 

c 



, EG. 

- 3 1 

NOTHING (BUT SPACE IS LEFT). 
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NOTHING AND NO SPACE IS LEFT. 
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MOVEY 13 


MOVEY 13 Y-D I SPLACEMENT FROM THE NORMAL POSITION OF THE 

ENTIRE SYSTEM, IN HALF-SPACES. IGNORED IF KIND=0. 
THIS IS INTENDED FOR USE MOSTLY WITH SINGLE LINE 
MUSIC. IF, IN A SCORE, SEVERAL VOICES HAVE DIFF- 
ERENT NONZERO VALUES, STRANGE RESULTS ARE LIKELY. 

** IDENTIFICATION/INDENT Cl) COMMAND ** 

ISIZE II SIZE OF CHARACTERS, 1 (VERY SMALL) TO 5 (VERY 

LARGE). (DEFAULTS IF STAFF IS SHARED, OTHERWISE 
3. ) 

Y— DISPLACEMENT FROM THE NORMAL POSITION, IN HALF- 
SPACES. THE NORMAL POSITION IS, IF AN UNSHARED 
STAFF, THE CENTER OF THE STAFF, OR IF A SHARED 
STAFF. JUST ABOVE OR BELOW THE CENTER. 

FONT. O=N0RMAL, 1=DYNAMICS FONT, 2=ITALICS. (SEE 
NQTF ON DYNAMICS FONT UNDER THE P' COMMAND. ) 
'M'=LEFT MARGIN CHANGE ( 'HANGING INDENT'), ANY- 
THING ELSE= INDENT RELATIVE TO CURRENT LEFT MARGIN 
FOR THIS STAFF ONLY. 

DISTANCE TO INDENT STAFF IN CHARACTERS, . GE. 0. 
NUMBER OF CHARACTERS IN THE IDENTIFICATION (MAX- 
IMUM 32). 

LITERAL CHARACTERS TO BE WRITTEN AT THE LEFT END 
OF THE CURRENT STAFF. IF INDENT=0 THE STAFF WILL 
BE INDENTED BY SLIGHTLY MORE THAN NOTP CHAR- 
ACTERS. N. B. FOR PROPER RESULTS, THIS COMMAND 
SHOULD BE GIVEN IN THE FIRST MEASURE AFTER AN 'S' 
COMMAND THAT STARTS A NEW SYSTEM. IF MORE THAN 
ONE 'I' COMMAND APPLIES TO A SINGLE STAFF, THE 
LAST ONE WILL BE USED. 

»* CONTROL (*) COMMAND 


IFONT 


MARG 


INDENT 

NOTP 


II 


A1 


12 

12 


ISTRNG 8A4 


KTLV 


13 


KTLWD 2A4 


IN BEAM, CLEF, TRANSP, RHYTH, ECHO. SET, IFEQ, 
SHARE, TNOW, TLFT, HCROWD, HBASE, HMASH, JUST, 
REPKS COMMANDS, AS DESCRIBED BELOW. ON ALL OTHER 
COMMANDS, IGNORED. 

THE CONTROL WORD, LEFT JUSTIFIED. ONLY THE FIRST 
FOUR CHARACTERS ARE SIGNIFICANT, E. G. 'CHRONIC' IS 
TAKEN TO MEAN 'CHROM'. CONTROL WORDS ARE <$ IND- 
ICATES DEFAULT CONDITIONS, ASSUMED AT THE BEGIN- 
NING OF EACH VOICE) — 

$ DESYNC DECOMPOSE (INTO TWO OR MORE TIED 

NOTES) ALL NOTES THAT WOULD OBSCURE 
THE UNDERLYING RHYTHM OR CROSS THE 
BARLINE. THIS ASSUMES NO MORE THAN 10 
BEATS PER BAR (NOTE THAT 12/8 IS OK, 
SINCE IT'S 4 BEATS PER BAR). 

NODESYNC DO NOT DECOMPOSE NOTES. 

BEAM BEAMING CONTROL, CONSISTING OF EXTENT 

OF EACH BEAMABLE AREA (THE TEN'S DIG- 
IT) AND AGGRESSIVENESS OF BEAMING (THE 
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ONE'S DIGIT). EXTENT=0 MEANS EACH BEAM— 
ABLE AREA EXTENDS FOR ONE BEAT. (AS DE- 
TERMINED FROM THE TIME SIGNATURE). 

=1 MEANS FOR ONE UNIT OF RHYTHMIC 
STRENGTH ONE LEVEL ABOVE THE BEAT. 

=2 FOR ONE UNIT OF RHYTHMIC STRENGTH 
TWO LEVELS ABOVE THE BEAT (ALMOST AL- 
WAYS THE WHOLE BAR). FOR EXAMPLE. IN 
5/4 TIME, EXTENT=0 MEANS QUARTER— NOTE 
DURATION. =1 ALTERNATE HALF AND DOTTED- 
HALF DURATION. =2 THE WHOLE BAR. WITH 
AGGRESS I VENESS=1, THESE WOULD CE WRIT- 
TEN '* OIBEAMS '* 1 1BEAM '» AND 
'* 21BEAM'. RESPECTIVELY. IF AGGRESS- 
IVENESS=0» NO BEAMING IS DONE. RE- 
GARDLESS OF EXTENT. AGRESSIVENESS=1 
MEANS TIMID BEAMING. I. E. ALL SUITABLE 
GROUPS OF NOTES AND OF GRACE NOTES 
WITHIN EACH BEAMABLE AREA ARE BEAMED 
TOGETHER BUT BEAMS ARE BROKEN BY ANY 
SYMBOL THAT MIGHT CAUSE INTERFERENCE 
(RESTS, CLEF CHANGES, ETC. ). AGRESS- 
IVENE55=2 MEANS BEAMS WITHIN AN AREA 
ARE BROKEN ONLY WHEN ABSOLUTELY NEC- 
ESSARY— SOMETIMES NOT EVEN THEN. ( DE- 

FAULTS 1. I. E. 1-BEAT AREAS AND TIMID) 

$ BAR AUTOMATICALLY (ACCORDING TO TIME SIG- 

NATURE) SUPPLY SINGLE BARLINES. 

NOBAR DO NOT SUPPLY BARLINES. 

CLEF CHANGE TO THE BETTER CLEF OF THE TWO 

GIVEN IN THE FIRST TWO DIGITS OF KTLV 
IF THE CURRENT CLEF REQUIRES TOO MANY 
LEDGER LINES. CODES FOR THE CLEFS ARE 
THE SAME AS ON THE 'S' COMMAND. FOR 
EXAMPLE, '*57 CLEF' MEANS USE EITHER 
TENOR OR BASS CLEF. 

$ NOCLEF DO NOT CHANGE CLEFS EXCEPT AS SPECIFIED 
BY S-C-K-M COMMANDS. 

TRANSP TRANSPOSE EVERYTHING KTLV SEMITONES. 

$ FLAT USE FLATS FOR ALL HALF-STEP ACCIDENT- 

ALS. 

NOFLAT USE SHARPS. 

FLAT AND NOFLAT ARE AUTOMATICALLY INVOKED BY 

FLAT AND SHARP KEY SIGNATURES, RESPECTIVELY. IF 

NOCHROM IS IN EFFECT, THEY ARE IGNORED. 

SHRINK IN COMPOUND METERS. COMPRESS DURATIONS 

WITHIN ARTIFICIAL GROUPS. 

$ NOSHRINK IN COMPOUND METERS, STRETCH DURATIONS 
WITHIN ARTIFICIAL GROUPS. 

DURATIONS OF NOTES IN ARTIFICIAL 
GROUPS IN SIMPLE METERS ARE ALWAYS 
COMPRESSED. 

RHYTH* MULTIPLY THE DURATIONS OF NOTES AND 

RESTS AND DIVIDE DENOMINATORS OF TIME 
SIGNATURES BY KTLV (WHICH MUST BE 1. 

2, OR 4). 
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RHYTH/ DIVIDE DURATIONS AND MULTIPLY DENOMI- 
NATORS BY KTLV (WHICH MUST BE 1, 2, 

OR 4). 

RHDEC ACCEPT RHYTHMIC INPUT IN 'DECIMAL' 

FORM. N. B. RHDEC WILL NOT WORK WITH 
NODESYNC . 

$ NORHDEC ACCEPT RHYTHMIC INPUT AS RECIPROCALS 
OF ACTUAL DURATIONS. 

S CHROM ACCEPT PITCH INPUT IN CHROMATIC FORM. 

NOCHROM ACCEPT PITCH INPUT IN NONCHROMATIC 

FORM. 

ECHO READ (E. G. SCAN FOR DATA) THE NEXT 

LINE KTLV TIMES. N. 3. THE LINE 
ECHOED MAY NOT CONTAIN AN 'X' COMMAND. 

READALT IF CURRENTLY READING FROM THE STANDARD 
READ UNIT, SWITCH TO THE ALTERNATE. IF 
CURRENTLY READING FROM THE ALTERNATE 
READ UNIT, SWITCH TO THE STANDARD. 

SET SET VARIABLE FOR TESTING BY IFEG TO 

KTLV (DEFAULT=999), 

IFEG IF KTLV . NE. THE SET VARIABLE, SKIP 

INPUT UNTIL AN END IF IS FOUND. EX- 
CEPTION. . . IF THE SET VAR IABLE=999, 

NEVER SKIP, 

END IF TERMINATE THE RANGE OF AN IFEG , 

SHARE OVERRIDE SHARED STAFF STATUS IN PASS I 

ONLY. KTLV=1 TELLS PASS I THIS IS THE 
UPPER VOICE OF A SHARED STAFF, =-l 
THIS IS THE LOWER, =0 THIS STAFF IS 
UNSHARED. 

IF EITHER THE TNOW OR THE TLFT COMMAND IS USED TO 
DECREASE THE ELAPSED TIME IN A MEASURE, TERRIBLE 
RESULTS MAY OCCUR. THEY ARE BEST USED ONLY FOR 
ANACRUSES. 

TNOW SET ELAPSED TIME WITHIN CURRENT BAR TO 

(KTLV/NUMBER FOLLOWING 'TNOW' IN 13 
FORMAT). 

TLFT SET REMAINING TIME WITHIN CURRENT BAR 

TO (KTLV/NUMBER FOLLOWING 'TLFT' IN 13 
FORMAT), 

DEFAULTS FOR THE FOLLOWING CONTROL WORDS ARE SET 
ONLY AT THE VERY BEGINNING OF THE RUN, AND WHAT- 
EVER VALUE IS IN EFFECT AT THE END OF THE DATA IS 
TAKEN TO APPLY THROUGHOUT. IF IT IS NECESSARY TO 
VARY THEM WITHIN A RUN. A USERS OR USER 4 CAN 
DO SO. 

$ SYST WHEN A SYSTEM IS FILLED, AUTOMATICALLY 

BEGIN ANOTHER WITH THE SAME CLEFS AND 
KEY SIGNATURES. 

NOSYST DO NOT BEGIN A NEW SYSTEM UNTIL AN 
S-C-K-M COMMAND WITH KIND NE. 0 IS 
FOUND. 

HCROWD SET THE 'IDEAL' HORIZONTAL SPACING TO 
NOTP PERCENT OF THE USUAL VALUE. IF 
THE SHORTEST NOTE IN THE SCORE IS, SAY, 
A QUARTER NOTE, ONE MIGHT SET HCROWD 
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TO 75 TO AVOID LEAVING SPACE FOR 
SHORTER NOTES THAT NEVER OCCUR. 

HBASE SET THE BASE FOR 'IDEAL' HORIZONTAL 

SPACING CALCULATION TO l + (HBASE/IOOO) , 
HBASE=0 GIVES CONSTANT SPACING, IN- 
DEPENDENT OF DURATION. HBASE=797 
GIVES APPROXIMATELY LINEAR SPACING, 
PROPORTIONAL TO DURATION. BOTH LOOK 
BAD. REASONABLE VALUES ARE FROM ABOUT 
350 TO 700. ( DEFAULT=520 ) 

TO CLARIFY THE ABOVE TWO COMMANDS, 'IDEAL' SPAC- 
ING IS CALCULATED BY THE EXPRESSION 
CRQWD*BA5E** ( LOGS C DURATION ) ) , 

WHERE CR0WD=HCR0WD/100 AND BASE=l+< HBASE/ 1000) . 
THIS IS IDEAL, NOT ACTUAL, SPACE BECAUSE EXTRA 
SPACE IS LEFT WHERE NEEDED FOR ACCIDENTALS, AUG- 
MENTATION DOTS, ETC. , AND THEN RIGHT-JUSTIFICA- 
TION USUALLY AFFECTS SPACING FURTHER. 

HMASH SQUEEZE NOTES (INCLUDING ASSOCIATED 

ACCIDENTALS AND DOTS) TO NOTP PER- 
CENT OF THEIR WIDTH. THIS IS DIFF- 
ERENTR FROM AND, IN A WAY, MORE 
DRASTIC THAN CHANGING HCROWD AND 
HBASE , WHICH AFFECT ALL SYMBOLS, 

BUT ONLY THEIR POSITIONS, NOT THEIR 
SHAPES. 

$ MEASNO PUT MEASURE NUMBERS AT THE LEFT END 
OF EACH SYSTEM. 

NOMEASNO DO NOT PUT MEASURE NUMBERS. 

JUST KTLV=0 MEANS DO NOT R IGHT- JUSTIFY ANY- 

THING. =1 MEANS RIGHT- JUSTIFY EVERY- 
THING EXCEPT THE LAST SYSTEM (OR LAST 
STAFF OF EACH PART, IF DOING PARTS), 

=2 MEANS R I GHT- JUST I FY EVERYTHING. 
<DEFAULT=2) 

REPKS CONTROLS REPEATING THE OLD KEY SIGNA- 

TURE AT THE BEGINNING OF A STAFF IF IT 
IS CHANGED IN THE FIRST MEASURE OF THE 
STAFF. KTLV=0 MEANS DO NOT REPEAT THE 
OLD KEY SIGNATURE, =1 MEANS DO. NOTE 
THAT IF THE KEY SIGNATURE IS CHANGED 
IN THE MIDDLE OF THE BAR. =0 WILL PRO- 
DUCE INCORRECT NOTATION. (DEFAULTED 

EDIT (E) COMMAND 

USING EDIT COMMANDS REQUIRES A KNOWLEDGE OF SMUT PASS4 COM- 
MAND FORMATS. SEE ALL OF FASS4 SOURCE CODE. THERE ARE FIVE 
SUBCOMMANDS. . . 

COMMAND FORMAT 


CHANGE CRWWFNNNNN 

INSERT IRI I XXXXX 00 PP NNNNN. . AA. . , (N'S AND A'S 715, £A2) 

AGAIN AR 

DELETE DR 

PRINT PR 
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C THE OVERALL FORMAT IS 

C (Al, II. 12. Al. 15, 2UX, 12), IX, 915, 3A2) . 

C 

C THE R 'S (RECORD NUMBERS TO EDIT) ARE RELATIVE TO THE NEXT 
C NONEDITOR COMMAND RECORD, E. G. R=2 MEANS THE RECORD AFTER 

C NEXT, 

C WW=WORD NO. OF RECORD TO CHANGE. . . —2= INSTRUMENT NO. , -1=X, 

C 0=0P CODE, , GT. G=P ARAMETER NO. WW. UW . LT. -2=N0 OP, 

C F='R' MEANS NNNNN IS RELATIVE TO THE OLD VALUE. 

C I I=INSTRUMENT NO. , XXXXX=X, 00=0PC0DE, PP=NO. OF PARAMETERS. 

C NNNNN= I NTEGER PARAMETER. AA=ALPHANUMER I C PARAMETER. EXCEPT IF 

C WW=-1 (ON THE CHANGE COMMAND) NNNNN/ 1000 IS USED. 

C RECORD NUMBERS ARE NOT AFFECTED BY INSERTS AND DELETES. E. G. 

C EVEN IF THE FIRST COMMAND IS 'D1 ' A 'C2' STILL REFERS TO 

C THE SECOND FOLLOWING RECORD OF THE FILE. FOR A GIVEN REC- 

C QRD NUMBER, THE FOLLOWING RULES APPLY. . . 

C IF A DELETE COMMAND APPEARS, NOTHING ELSE IS ALLOWED. 

C THIS MEANS THAT ONE CANNOT REPLACE A RECORD BY DOING 

C 'D' FOLLOWED BY 'I', BUT MUST USE 'I* ON THE PREVIOUS 

C RECORD, THEN 'D' ON THIS ONE. 

C ANY SEQUENCE OF 'C 'S AND 'A'S IN ANY COMBINATION IS LEGAL 

C AND DOES WHAT YOU WOULD EXPECT. 

C 

C RECORD NUMBERS AND PARAMETERS FOR EDITING CAN BE DETERMINED BY 
C 1ST RUNNING SMUT WITH LISTC . GT. 0. REGARDLESS OF LISTC, 

C SMUT WILL PRINT BACK RECORDS ALTERED BY 'C ', 'A'. OR 'I' 

C COMMANDS. (SEE SUBROUTINE PA5S4 . ) NOTE THAT CHANGING 

C ALMOST ANYTHING IN THE REGULAR INPUT STREAM (E. G. , ASKING 

C FOR SCORE INSTEAD OF PARTS OR VICE-VERSA, CHANGING THE PAGE 

C SIZE. ADDING OR DELETING NOTES OR RESTS, ETC. > MIGHT AFFECT 

C THE RECORD NUMBERS OR PARAMETERS. SO THE USER SHOULD BE 

C CAREFUL TO HAVE EVERYTHING ELSE SETTLED BEFORE MAKING UP 

C EDITOR COMMANDS. 

S' 

U 

C COMMANDS TO THIS EDITOR EFFECTIVELY BYPASS THE ERROR CHECKING 

C DONE BY PASSES 1, 2, AND 3 OF SMUT , SO A MISTAKEN EDITOR 

C COMMAND CAN CAUSE JUST ABOUT ANYTHING. 


u 


C ** MISCELLANEOUS SYMBOL (M) COMMAND ** 


c 

MOVE 

13 

LEAVE (MOVE/S)* (HEIGHT OF STAFF) SPACE BEFORE 
SYMBOL. 

n 

V* 

KSYM 

A2 

ONE OF - 


C 

C 

C 

C 

C 

c 

c 

c 

c 

c 

c 

c 


. BREATH MARK 
// PAUSE MARK 
(BLANK) NONE 

CALL USER ROUTINE (U) COMMAND **■ 

IPARM1 13 1, 3, OR 4 RESPECTIVELY MEAN CALL USER-SUPPLIED 

SUBROUTINE USER1 , USER 3 . OR USER 4 IN PASS I, 
III, OR IV AT THIS POINT IN PROCESSING THE DATA. 
THESE ROUTINES CAN THEN PERFORM ANY FUNCTIONS THEY 
WISH. THIS FEATURE ADDS CONSIDERABLE POWER TO 
SMUT , HOWEVER. WRITING A USER ROUTINE REQUIRES 
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A FAIRLY DETAILED KNOWLEDGE OF SMUT'S INTERNAL 
WORKINGS. 

IPARM2-4 13. 14, 13 PARAMETERS TO BE PASSED TO THE USER ROUTINE, 
■a-* OCTAVA (T) COMMAND ** 


NT 

13 

+OR- B, 15, OR 22 MEANS START OCTAVE SIGN NOTATION 
WITH NEXT NOTE — ABSOLUTE VALUE OF 8 MEANS BVA, 15 
MEANS 15MA, 22 MEANS 22DA. POSITIVE VALUE MEANS 
50PRA, NEGATIVE BAS5A. 

NT=Q OR -H3R- 1 MEANS LOCO (END OCTAVE SIGN NOTA- 
TION) IMMEDIATELY. 

MQVEY 

13 

THE Y— D I SPLACEMENT OF THE NUMBER AND DOTTED LINE 
IN HALF-SPACES, RELATIVE TO THE NORMAL POSITION. 
THE VALUE ON THE START OCTAVE NOTATION COMMAND IS 
USED AND THAT ON THE LOCO IS IGNORED. 


** END— OF— VQ ICE (X) COMMAND «* 

A1 'X' (I. S. # THE COMMAND 'XX') INDICATES END-OF- 

DATA. ANYTHING ELSE MEANS SKIP TO THE NEXT LINE 
AND BEGIN READING DATA FOR THE NEXT VOICE. 
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PROGRAM OPERATION 

USING SMUT MOST EFFECTIVELY REGUIRES SOME UNDERSTANDING OF 
ITS INTERNAL WORKINGS. SMUT 2.9 CONSISTS OF FOUR 'PASSES'. AS 
FOLLOWS— 

PASS I READS THE ENTIRE DATA FILE, ARRANGED AS ALL OF VOICE X, 
THEN ALL OF VOICE 2, ETC., UNTIL A 'XX' COMMAND OR END-OF-FILE 
IS FOUND. DEPENDING ON WHAT OPTIONS ARE ENABLED BY CONTROL 
COMMANDS, IT PERFORMS VARIOUS 'HIGH-LEVEL' FUNCTIONS — DECIDING 
WHICH NOTES TO BEAM TOGETHER, WHEN TO CHANGE CLEFS, HOW TO 
SIMPLIFY ' 'DESYNCQPATE ' OR CLARIFY) RHYTHM. APPROXIMATELY WHERE 
SLURS AND ACCESSORY NUMERALS SHOULD BE PLACED. WHAT COURTESY 
ACCIDENTALS TO USE, ETC. — AND WRITES ITS RESULTS ON A SCRATCH 
FILE (LOGICAL UNIT 10), STILL WITH ALL OF EACH VOICE TOGETHER. 
THE INFORMATION WRITTEN IS EXACTLY THE VSAME, WHETHER A SCORE 
OR SET OF PARTS IS CALLED FOR, WITH ONE EXCEPTION — MULTIBAR 
RESTS. IF PARTS ARE WANTED, THIS KIND OF 'R ' COMMAND IS RE- 
PLACED BY WHOLE REST COMMANDS SEPARATED BY BARLINE COMMANDS, 

IF A SCORE, THE MULTIBAR REST COMMAND IS KEPT. 

IN ORDER TO ALIGN EVERYTHING IN THE SCORE PROPERLY, IF ONE IS 
CALLED FOR, SMUT NEEDS TO EXAMINE EVERYTHING HAPPENING IN BAR 
1 TOGETHER, THEN EVERYTHING IN BAR 2, ETC., SO IN THIS CASE 
PASS II SORTS THE SCRATCH FILE ONTO A SECOND SCRATCH FILE (UNIT 
11). IF A SET OF PARTS IS WANTED, PASS II DOES ABSOLUTELY 
NOTHING EXCEPT PRINT SOME INFORMATION PRODUCED BY PASS I. 

PASS III READS SCRATCH FILE 2, ONE BAR OF SCORE OR ONE BAR OF A 
PART AT A TIME, AND ASSIGNS Y-COORD I NATES TO SEVERAL THINGS 
THAT COULD NOT BE HANDLED BY PASS I (FOR EXAMPLE, EXACT COORD- 
INATES OF BEAMS AND SLURS). MORE IMPORTANTLY, IT ASSIGNS TEMP- 
ORARY X-COORDINATES TO EVERYTHING - I. E. , IT JUSTIFIES THE BAR 
- AND IT DECIDES WHEN A SYSTEM IS FULL AND A NEW ONE MUST BE 
BEGUN. THIS IS DONE BY THE FOLLOWING METHOD. . . 

1. MAKE A LIST OF ALL 'TIMES' WITHIN THE BAR WHERE ANYTHING 
HAPPENS, E. G. A NOTE OR REST IN ANY VOICE STARTS. (THIS 
LIST IS SOMETIMES CALLED THE 'RHYTHMIC SPINE'. ) 

2. ASSIGN 'IDEAL' SPACING5 TO EACH OF THESE TIMES WITH THE 
FORMULA IDEAL SPACE = 2. 5»CR0WD*(BASE**L0G2(TIME> ) . 
CROWD MND BASE ARE SET AS DESCRIBED UNDER THE HCROWD 
AND HBASE COMMANDS, ABOVE. THE DEFAULTS ARE 1. O AND 

1. 520. 

3. IF THIS DOES NOT LEAVE ENOUGH SPACE BETWEEN CONSECUTIVE 
SYMBOLS WITHIN ANY VOICE, INCREASE IT AS NECESSARY. 

4. IF THE RESULTING BAR WIDTH OVERFLOWS THE CURRENT STAFF AT 
ALL AND AUTOMATIC STAFF BREAKING WAS REQUESTED. START A 
NEW STAFF, EVEN IF NOT, IF IT OVERFLOWS THE CURRENT 
STAFF BADLY. GIVE AN ERROR MESSAGE AND START A NEW STAFF. 
IF STARTING A NEW STAFF RESULTS IN THE OLD ONE HAVING NO 
MUSIC IN IT, GIVE AN ERROR MESSAGE. 

PASS III WRITES RECORDS EACH DESCRIBING ONE MUSICAL SYMBOL ON 
A THIRD SCRATCH FILE (UNIT 12), AND CONTROL INFORMATION ON A 
FOURTH (UNIT 13). THESE FILES ARE IMPORTANT IF A PASS IV 
EDITOR IS USED - SEE BELOW. 



PASS IV READS SCRATCH FILE 3 AND DOES THE ACTUAL PLOTTING. AS 
IT GOES. IT MULTIPLIES THE TEMPORARY X-COORDINATES OF EACH 
SYMBOL BY AN APPROPRIATE CONSTANT SO THAT THE SYSTEM IS 
STRETCHED TO EXACTLY THE CORRECT LENGTH (THAT GIVEN ON THE 
LAYOUT LINE), UNLESS RIGHT-JUSTIFICATION HAS BEEN TURNED OFF 
GLOBALLY BY A '* JUST' COMMAND OR FOR THIS SYSTEM BY A 'S' 
COMMAND THAT ENDED IT AND BEGAN THE NEXT SYSTEM. 

TWO EDITING VERSIONS OF THE PASS IV INPUT SUBROUTINE (READ4) EXIST. 
THESE ALLOW CHANGING OR DELETING ANYTHING ON UNIT 12 OR ADDING NEW 
RECORDS TO IT BEFORE ANYTHING IS ACTUALLY DRAWN. IF THIS IS DONE 
BY MAKING TWO SEPERATE RUNS OF SMUT IN EDIT MODE, BOTH PASS IV 
INPUT FILES - UNITS 12 AND 13 - MUST BE SAVED AFTER THE FIRST RUN 
AND AVAILABLE TO THE SECOND. THE SYSTEM FILE NAMES WILL BE IMP- 
LEMENTATION-DEPENDENT, BUT ON CDC MACHINES THEY ARE 'ZSMUT3 ' AND 
' ZSMUTX ' RESPECTIVELY. 


PRINTER LISTING 

IN ADDITION TO PLOTTED OUTPUT, THE PROGRAM PRODUCES A PRINTER LIST- 
ING OF THE DATA PLUS COMMENTS AND ERROR MESSAGES. AMONG THE COM- 
MENTS PRINTED ARE INDICATIONS OF WHERE EACH BAR AND EACH LINE OF 
PLOTTED MUSIC BEGINS. THE LISTING WILL BE SHORTENED CONSIDERABLY 
AND ALL ERROR MESSAGES WILL STILL APPEAR IF LISTC IS , LT. 0 (SEE 
LAYOUT LINE). 

IN SOME CASES VARIABLE NAMES WILL APPEAR IN THE LISTING WITH 
THEIR CURRENT VALUES. BESIDES INPUT PARAMETERS AS DESCRIBED ABOVE, 
THESE INCLUDE TNOW (ELAPSED 'TIME' IN WHOLE NOTES SINCE THE BE- 
GINNING OF THE BAR, E. G. 0.250 MEANS 1 QUARTER NOTE INTO THE BAR) 
AND XFACT ( X-COORDINATE SCALING FACTOR, FOR RIGHT-JUSTIFYING THE 
SYSTEM). 


ERROR MESSAGES 

A TYPICAL SMUT ERROR MESSAGE LOOKS LIKE THIS. . . 

N18******CaL. 21 AN OCTAVA OF 3 IS ALREADY IN EFFECT. 

THE 'N' IN 'N1S ' INDICATES THIS IS A NQNFATAL ERROR. OTHER TYPES 
OF ERRORS ARE 'F' (FATAL), 'W' (WARNING — MINOR PROGRAM ERROR OR 
POSSIBLE USER ERROR). AND 'S' (A SEVERE PROGRAM ERROR). 'COL. 21' 
INDICATES THIS ERROR WAS FOUND WHILE PROCESSING THE FIELD STARTING 
AT COLUMN 21 OF THE PRECEDING LINE. A COLUMN NUMBER IS GIVEN ONLY 
IF THE ERROR IS DETECTED DURING PASS I. IN MOST OTHER CASES, SMUT 
WILL GIVE THE BAR NUMBER AND. IF APPROPRIATE. THE VOICE NUMBER 
WHERE THE ERROR WAS DETECTED. 

WHATEVER CAUSED AN 'N' ERROR WILL GENERALLY BE IGNORED (SUBJECT 
TO MAXERR — SEE THE DESCRIPTION OF THE LAYOUT LINE PARAMETER) AND 
EXECUTION WILL CONTINUE, WHILE THE OCCURENCE OF AN 'F' OR 'S' 

ERROR WILL CAUSE THE PROGRAM TO STOP IMMEDIATELY. IF AN 'S' ERROR 
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OCCURS, FIRST CHECK WHETHER THERE ARE ANY ERRORS IN THE DATA THAT 
WERE NOT DETECTED BY SMUT , AND IF SO, CORRECT THEM. IF THE 
ERROR PERSISTS SEND A FULL (E. G. WITH LISTC . GE. 0, PREFERABLY 
WITH LISTC . GE. 4) OUTPUT LISTING PLUS PLOT, OR WHATEVER FRAGMENT 
OF A PLOT WAS MADE, TO DONALD BYRD AT THE ADDRESS ABOVE. 


REMARKS ON SPECIFIC MESSAGES. . . 

F01— TO CORRECT THE ERROR, REWRITE THE NOTE (REST) AS SEVERAL TIED 
NOTES (SUCCESIVE RESTS). (AS FAR AS I KNOW, THIS HAS NEVER 
OCCURRED. ) 

F02 — THERE ARE 5 POSSIBLE LAYOUT LINE ERRORS, NUMBERED 1-5 AS 
FOLLOWS. 

1 XLN . LT. 12*HS 

2 YDELT .LT. 2*HS 

3 YH+2. *HS . GT. YABMAX, OR 

YMIN , GE. YH CAT THIS INSTALLATION YABMAX= ) 

4 RESOL , LT. RESHIN (AT THIS INSTALLATION RESMIN= ) 

5 HS , LT. 15. *RESQL 

THE NONZERO DIGITS IN THE NUMBER AFTER 'TYPES' INDICATE WHICH 
ONES ACTUALLY OCCURED. 

F07 — THE MAXIMUM POSSIBLE NUMBER OF VOICES IN A SCORE IS AN IN- 
STALLATION PARAMETER, IVCMAX. (AT THIS INSTALLATION 
IVCMAX= ) 

F13 — IN ORDER TO JUSTIFY THE SCORE, SMUT MAKES A LIST IN EACH 

BAR (CALLED THE 'SPINE') OF ALL 'TIMES' (E. G. , THE BEGINNING 
OF THE BAR, AN EIGHTH NOTE INTO IT, A TRIPLET QUARTER INTO 
IT, ETC. ) WHEN ANY SYMBOL APPEARS, INCLUDING CLEFS, TIME SIG- 
NATURES, ETC. IF A SCORE IS BEING MADE, THE LIST WILL BE 
FOR ALL VOICES FOR EACH BAR. IN SOME CASES SEVERAL NONOB- 
VTPUS ENTRIES MAY APPEAR IN THE LIST. NOW. THIS ERROR MESS- 
AGE INDICATES THAT THE MAXIMUM POSSIBLE SIZE OF THE LIST IS 
BEING EXCEEDED. 

Flo — SMUT MUST START A NEW SYSTEM, EITHER BECAUSE OF AN 'S' COM- 
MAND OR BECAUSE THE CURRENT BAR WILL NOT FIT IN THE OLD SYS- 
TEM, BUT IT FINDS THAT THE OLD SYSTEM CONTAINS ZERO BARS. 

THIS INDICATES (IN THE *5' COMMAND CASE) TWO 'S' REQUESTS FOR 
A NEW SYSTEM VERY CLOSE TOGETHER, IN THE OTHER CASE THAT THE 
CURRENT BAR IS SO LONG THAT IT WILL NOT EVEN FIT IN A WHOLE 
BY ITSELF, THE LATTER CAN BE CURED BY INCREASING XLN . DE- 
CREASING HS (SEE LAYOUT LINE), OR BY USING THE '•» HCROWD' 
COMMAND. 

F17 — BARLINES MUST COINCIDE IN ALL VOICES. SMUT CHECKS THIS BY 

COMPARING THE DURATION OF EACH MEASURE IN VOICE 1 TO THE DUR- 
ATION OF THAT MEASURE IN EACH OTHER VOICE. DURATIONS ARE IN 
WHOLE NOTES. 

NOS — EVEN WITH NOSTAFF . SHUT WILL START A NEW LINE WHEN ONE BE- 
COMES FAR TOO CROWDED, I. E. WHEN AT THE END OF A BAR SPACING 
WOULD HAVE TO BE REDUCED BY A FACTOR OF 1.5 OR MORE TO MAKE 
THE LINE FIT IN A LENGTH OF XLN MINUS ANY INDENT (SEE THE 
LAYOUT LINE). 

NOS — THE KSYM FOLLOWING THE 'TM' MUST BE A LEFT-JUSTIFIED DIGIT, 
0-5. 
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N30— 'AREA ' HERE MEANS BEAMABLE AREA AS USED IN THE DESCRIPTION OF 
THE '* BEAM' COMMAND ABOVE, I. E. ONE BEAT UNLESS CHANGED BY 
'* BEAM', - 

W02 — SMUT SOMETIMES DOES NOT SPACE NOTES FAR ENOUGH APART TO DRAW 
NICE LOOKING BEAMS, SLURS, AND GLISSANDI ON OR BETWEEN THEM. 

BY THE TIME SMUT NOTICES THE PROBLEM AND ISSUES THIS MESS- 
AGE, IT IS TOO LATE TO CORRECT IT. 

WOA— DECREASING TNOW EVEN SLIGHTLY IS DANGEROUS BECAUSE IT CAN 

CONFUSE SMUT'S RHYTHM HANDLING AND, FOR EXAMPLE, PRODUCE ERROR 
SOS. IF DECREASING TNOW RESULTS IN A NOTE OR REST HAVING AN 
ATTACK TIME EARLIER THAN A PREVIOUS NOTE OR REST, ERROR S09 
WILL RESULT. 

W05 — IN TRYING TO DECIDE WHETHER A FRACTIONAL BEAM SHOULD POINT TO 
THE LEFT OR TO THE RIGHT, SMUT CANNOT FIND A REASONABLE RHYTH- 
MIC STRENGTH FOR THE NOTE WITH THE FRACTIONAL BEAM OR FOR ONE 
OF THE ADJACENT NOTES. 

W06 — IN TRYING TO DECIDE WHETHER A FRACTIONAL BEAM SHOULD POINT TO 
THE LEFT OR TO THE RIGHT, SMUT ATTEMPTS TO GROUP THE FRAC- 
TIONAL BEAMED NOTE WITH THE PRECEDING OR FOLLOWING NOTE, BUT 
FINDS ALL THREE NOTES IN THE SAME GROUP, 
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EXAMPLES OF ALL ERROR MESSAGES FROM SMUT 2. 9 


NO. ROUTINE MESSAGE 


FOl DECOMP 
F02 INITSM 
F03 PASS1 

FO* PASS1 

F05 PASS1 
F06 PASS1 

F07 INITVC 
F08 WRBUF 
F09 PASS1 
FIO PASS! 

FI 1 PASS1 
F12 PASS3 


F13 PASS3 

F14 INITSM. 
OVER 2 

F15 JUST IF 
F16 JUST IF 
F17 JUSTIF 

FIS FASS1 

FI 9 PASS1 
F20 JUSTIF 

F21 INITSM 


COL. 1 RHYTHM TOO COMPLEX FOR DESYNC ROUTINE, 

LAYOUT LINE ERRORS. TYPES 2040 

COL. 1 SYNCOPATED ARTIFICIAL GROUP WITH DESYNC IS 
ILLEGAL. 

COL. 1 ILLEGAL BASIC DURATION FOR ARTIFICIAL GROUP 
JGP= 0 

COL. 1 AN ARTIFICIAL GROUP IS ALREADY IN EFFECT, 

COL. 1 AN S COMMAND WITH KIND=1 MUST PRECEDE ALL 

EXCEPT * COMMANDS 

TOO MANY VOICES IN SCORE. 

PASS1 MEMORY OF 100 COMMANDS PER BAR EXCEEDED. 

COL. 1 DENOMINATOR OF TIME SIGNATURE IS ILLEGAL. 

COL. 1 ILLEGAL DURATION. AFTER TRANSFORMATION NDURB= 
17 

COL. 1 ILLEGAL NO. OF DOTS. IDT= 12 

BAR NO. 17 OVERFLOWS PASS3 MEMORY OF 2300 WORDS. 

IN VOICE 11 

BAR NO. 17 EXCEEDS PROGRAM CAPACITY OF SO ACTION 
TIMES. 

ZERO BARS OF MUSIC DATA WERE FOUND. 

BEAM TABLE SIZE ( B> EXCEEDED IN BAR 17 OF VOICE 3 
SYSTEM BEFORE BAR 17 HAS NOTHING IN IT. ' 

BAR NO. 17 FOR VOICE 3 HAS DURATION , 1250 
(VOICE 1 HAS 1.0000) 

COL. 1 MULTIBAR REST NOT AT BEGINNING OF A BAR. 

TNOW= . 7500 

COL. 1 PHYSICAL BARLINE FOLLOWED BY LOGICAL BARLINE 
SHARED STAFF CLEF CONFLICT IN BAR 6 VOICES 2 
AND 3 

LAYOUT LINE MISSING OR WRONG. 


N01 PASS1 
NOE PCHCON 

N03 PASS1, 
PROCST 
N04 N0TE1 
NOS JUSTIF 

N06 PROCST 
N07 PROCST 
NOB NQTE4 
N09 SUPER 
N10 PASS1 
Nil PASS1 
N12 USERX 
N13 PRQCG 


COL. 1 ILLEGAL OP CODE 'Z' 

COL. 1 PITCH OUT OF BOUNDS. AFTER TRANSPOSITION 
NOTP=-105 

COL. 1 *5 IS AN ILLEGAL CLEF NUMBER 

COL. 1 MORE THAN 3 NESTED SLURS REQUESTED. 

SYSTEM BEFORE BAR 2B IS OVERCROWDED BY A FACTOR OF 
1. 615 

COL. 1 ILLEGAL CONTROL VALUE. NOTP= 8 
COL. I ILLEGAL CONTROL WORD 'SHRAP 
INVALID TREMOLO SLASH COUNT '4-n-' IN VOICE 7 
INVALID NOTE-ASSOCIATED SYMBOL IN VOICE 7 

COL. 1 -S IS AN ILLEGAL KEY SIGNATURE. 

COL. 1 ILLEGAL CHARACTER COUNT OF 33 
UN IMPLEMENTED USER1 CALLED WITH 10 3 4 

COL. 1 ILLEGAL DURATION FOR GRACE NOTE. NDURG= 1 
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N14 DECODE 

N15 NQTE1 
N16 PASS1 
N17 PRQCT 
N18 PRQCT 
N19 PASS1 
N20 PASS1 
N2I PASS l 
N22 NOTE1 
N23 BEAMCK 

N24 PASS1 
N25 INITSM 
N26 INITSM 
N27 INITSM 
N28 PLTEXT 

N2P PLTEXT 
N30 BEAMCK 
N31 PLTEXT 

N32 PASS1 
N33 INITSM 
N34 INITSM 
N35 STRING 

N36 INITSM 
N37 PROCST 
N3S INITSM 


COL. 1 ILLEGAL DATA IN NUMERIC FIELD, IFLAG=-3 
DATA ' 4F ' 

COL. 1 GLISSANDO STARTS AND ENDS ON SAME PITCH 
COL. I LAST BAR INCOMPLETE. TNOW« 1. 167 
COL. 1 ILLEGAL QCTAVA PARAMETER 16 
COL. 1 AN OCTAVA OF 0 IS ALREADY IN EFFECT. 

COL. 1 INCOMPLETE ARTIFICIAL GROUP AT END-OF-DATA 
COL. 1 1 UNFINISHED SLURS AT END-OF-DATA 

COL. 1 20 IS TOO MANY CHARACTERS FOR THIS SIZE STAFF, 

NO SLURS ARE IN PROGRESS. 

SEAMING IN BEATS WITH OVER 99 NOTES OR GRACE NOTES IS 
IMPOSSIBLE. 

CALL TO USER ROUTINE 0 IS ILLEGAL. 

TOO MANY LINES OF HEADER AND/OR FOOTER 
CHARACTER SIZE MUST BE A DIGIT. 

INITIALIZATION OP CODE '7' IS ILLEGAL. 

ILLEGAL JUSTIFICATION TYPE 'D'. LINE... 

H5 DONALD BYRD' 

NO CLOSING DELIMITER / (OPENING / IN COL. 5) 

MORE THAN 30 BEAMS IN AREA. 

SUBLINE LENGTH OF 8. 129 EXCEEDS PAGE WIDTH OF 7. 700 
SUBLINE 'SONATA IN D MAJOR, . OPUS 10 NO. 3, I, BAR 9-12' 
UNFINISHED OCTAVA AT END-OF-DATA 
ILLEGAL CHARACTER 'H' IN BAR OR GROUP LINE 
GROUPING SPECIFIED FOR OVER 20 VOICES 

STRING BUFFER SIZE EXCEEDED AT CHARACTER 157, STRING. . 
THIS WORK, COMPLETED IN SEPTEMBER 1936, WAS FIRST PERF 
BARS SPECIFIED FOR OVER 60 SYSTEMS. 

TNOW OR TLFT CANNOT BE SET TO 8/ 0 

INITIALIZATION OP CODE G IS LEGAL ONLY IN A SCORE. 


501 POSTNR 

502 DECODE 

503 N0TE1, 
REST1 

504 JUSTIF 

505 PASS3 


506 SLURCK 

507 RFILL 

508 RFILL 

509 JUSTIF 


COL. 1 PROGRAM ERROR***MISSING BARLINE. DEAR, TNOW. 
DFAC 1.0000 1.2500 1.0000 

COL. 1 PROGRAM ERROR***BAD FORMAT. IFLAG=-3 
DATA '(+13)' 

COL. 1 PROGRAM ERROR***TOa MANY FLAGS. NDURB= 10 
DLN0TE, DENOTE . 3333 , 0026 

PROGRAM ERROR*** . 063 NOT FOUND IN ACTION TIME LIST, 
BAR 17 OF VOICE 3 

PROGRAM ERRQR***OP CODE 6 HAS 5 PARAMETERS 

INSTEAD OF 3 

BAR 17 OF VOICE 3 

PROGRAM ERROR***TR IED TO END 2 SLURS BUT ONLY HAD 1 
PROGRAM ERROR, POINT 12 <Y=5. 6480) ABOVE POINT 1 
(Y=5. 5712) 

PROGRAM ERROR, 3 POINTS IS TOO FEW 

PROGRAM ERROR***BAR 27 OF VOICE 4 HAS DECREASING 
TIMES. 250. 000 FOLLOWED BY 62. 500 


W01 RTEST COL. 1 WARNING, NOTE AND BAR DURATIONS BOTH , GE. 2 

W02 PASS4 WARNING. SLUR LENGTH OF , 075 TOO SHORT IN VOICE 3 

W03 THARC WARNING. IMPOSSIBLE ARC REQUEST, 5. 682 8. 404 5. 441 

8. 510 , 04 
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W04 PROCST 
WQ5 LFBEAM 
W06 LFBEAM 


WARNING, DECREASING TNOW (FROM 0.250 TO 0.063) IS 
DANGEROUS. 

CANT FIND RHYTHMIC STRENGTH FOR FRAC. BEAM DIRECTION, 
0. 2794 -99 

ALL THREE NOTES IN GROUP 1 FOR FRAC. BEAM DIRECTION 



SAMPLE DATA 


1-30 


LS 0. OOS 20 . 70 

HS C 'SLOTH CANON' 

H4 C 'FROM THE MUSICAL OFFERING, BWV 1079' 

H6 R 'J. S. BACH' 

F5 C 'SMUT 2. 9 SAMPLE PLOT FROM MUSTRAN DATA ' 
X 

* 05HCROWD 

* NOBAR $* NODESYNC 

* NOCHROM = 

SI 1-344 -2= 

PI 4 02 Q(ADAGIO) 

I 00 0 BINVERSUS 

R 2 O 

R S O 

N 33 0 16 0 -- 

N 32 0 16 0 

N 31 0 4 OTI 

B 1 

N 31 0 8 0 = 

X 

* S5HCR0WD 

* NOBAR S* NODESYNC 

* NOCHROM 

S 1 1 -3 4 4 —2 s * 

PI 4 02 B(ADAGIO) 

I 00 01 1RQYAL THEME 

N 29 0 4 OTI 

N 29 0 16 0 

N 28 30 16 0 = 

N 29 O 16 0 

N 30 0 16 0 = 

N 31 0 4 1 

N 32 0 8 O 

B 1 

N 33 0 4 0 

X 

* S5HCR0WD 

* NOBAR NODESYNC 

* NOCHROM = 

S 1 7 -3 4 4 -2= 

PI 4 02 3 (ADAGIO) 

I 00 O 6RECTUS 

r a o 

R 16 0 

N 29 0 32 O 

N 30 0 32 0 

N 31 0 3 1 

N 32 0 16 0 

N 33 0 4 1 » 

R 32 0 = 

N 32 0 32 0 

N 31 0 32 0 = 

XX 



Appendix II 


i. Clef Signatures 

£w» tfc ^ran 3 'O^r-tl a nj TV,i tV *ut t/ft « *po^f ti o ry 



GS. tft t 4^, 8ft, U>ft 52ft, b4A, 


C. 



G8S ? 4e-„ '4..., SA., ItA, 464-., 8Sr, 


Example A: ^otcs and Rests 



GS, fc1$K, K.3**., H3Nfc, FS, K3$k;. KA*K, 

<Cl4r^V * 


8, Time Signatures 



9. Metronome Counts® N-M.J * ns-i5o HmA*‘15 



/, //, e*4 C> COMHA) /, LA3, /, 


Example 8, 



G6, M*K, 4=4,4*- A AE. J 8E i a€ 1 «P 1 ^j | / ( 4|S i B£4F, 4C- /,4A. BA ,46, 4P, / f ZG^ 4A t 43, //, EWD 



II — 2 


L2- Grace Notes and Trills 


TR TR 


&s, 4=4, 2£., i^,fcP«\4et,8Pc-i. 1 / ! itsc(- ( 4e^ 2E. , //, 2D+zi, tJa.zli, //,end 

13. Tied Notes 13a. Slurs “ 



G5, 4AH+H, /,40-Htri, 2A.H-4tf,//,E>iP 4ft, Zll, ZA^L.AA, //, HMD 
16. Triplets ("Groupettes") 


&S, 1C,A-P, 4Gr,3?, 3C,6B,8t:+,Q0,3;, /, 4C ,ac.+,30,8ci ,«M), 8A-, 2A , 

17. Repetitions (not at bar line) (at bar Line) 1 1. fT 


T(, T)/C, T), T/C , TJ/Cj TJ/ } ATI), T),TZ) ; 


18. Additions 


Aaavflo 


G6, ^ULjSSfc*, Ak } 46, 4A, ‘Ru&Aro^^g- f 4£r, /, |c+ # //, £HP 

19. Multiple Staffs and Transposition 


Tb TT2MKP0SS THe TOP U»4£ a SEHac^i^S HlCrWEAj-nig fbLu>itKiG Coo^ *5 US6P- 
LHIa | G» J ^ [ gC,8tC. | tb&,// l U. l G&i 3=9 1 06-, 4C , // 



on a Single Staff I 


Gr&, 4c£rY4A ,4£Y 4*Y4*n^^V4E.4q4^ aXya€14g*. ,e>io 


21. Performance !n-J lent ions ond I'yfiJiT.icr*^- 4 


as^vi^AV^arv^ii.HCf u, ho *l% hcwui^axi, <mis ; *mx:o,hw hfxi5, wpp^*, v/mfu/, 



Index to Definitions 


N.B. All of the following are mtitie terms, unless indicated otherwise. The list 
plete: many more terms are defined in Chapter 2. 

accidental: 2.3.2 
accessory numeral: 2.3.3.1 
amanuensis: 1.2 
anacrusis: 2.3. 3.2. 3 
beat: 2.3.3.2.1 
casting off: 4.5 

cautionary accidental: 2.6, 4.3 
collision: 5.2.1 

conventional music notation: 2.2.2 
cue: 2.3.Q.3 

enharmonic notation: 3.2.5 
fractional beam: 2.3.3.3.2 
groupet: 2.3.3.1 
hunk: 2.3.0.2.2 

image space (computers): 3,3.6 
kerning (typesetting): 5. 2. 1.1 
ligature (typesetting): 5.2.1.1 
measure: 2.3.3.2 
movement: 2.3.1 
music setting: 1.2 
notation: 2.2.1 
note: 2.3.1 

object apace (computers): 3.3.6 

part: 2.3.1 

punctuation: 4.5 

quad tree (computers): 5.2.1.3 

quantisation: 3.2.5 

rest: 2.3.1 

rhythm clarification: 3.2.5 

rhythm fitting: 3.2.5 

rhythmic strength: 2.3.3.2 t 4.4.1 

score: 2.3.1 

situ: 2.3.3.1.1 

solo: 2 .3.6.4 

spatial notation: 2.3.3.4 

stall: 2.3.2 

system: 2.3.1 

system break: 3.1 

textnre: 2.3.6 

tutti: 2.3.Q.4 

voice: 2.3.1, 2.3.6 

voice assignment: 3.2.5 
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